THERAPEUTIC USES OF GENOME EDITING WITH CRISPR/Cas SYSTEMS

ABSTRACT

Disclosed herein are methods, compositions, and kits for high efficiency, site-specific genomic editing of cells for treating or preventing genetic blood disorders.

RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No. 16/278,645, filed Feb. 18, 2019, which is a continuation of U.S. application Ser. No. 14/509,924, filed Oct. 8, 2014 (now U.S. Pat. No. 10,208,319, issued Feb. 19, 2019), which is a continuation-in-part of PCT Application No. PCT/US2014/46034, filed Jul. 9, 2014, which claims the benefit of U.S. Provisional Application No. 61/844,333, filed on Jul. 9, 2013, and 61/869,369, filed on Aug. 23, 2013. The entire teachings of the above applications are incorporated herein by reference.

GOVERNMENT SUPPORT

This invention was made with government support under HL118744, HL098364, DK095384 and HL107440 awarded by the National Institutes of Health. The government has certain rights in the invention.

BACKGROUND OF THE INVENTION

Clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) systems are a new class of genome-editing tools that target desired genomic sites in mammalian cells. Recently published type II CRISPR/Cas systems use Cas9 nuclease that is targeted to a genomic site by complexing with a synthetic guide RNA that hybridizes to a 20-nucleotide DNA sequence and immediately preceding an NGG motif recognized by Cas9 (thus, a (N)₂₀NGG target DNA sequence). This results in a double-strand break three nucleotides upstream of the NGG motif. The double strand break instigates either non-homologous end-joining, which is error-prone and conducive to frameshift mutations that knock out gene alleles, or homology-directed repair, which can be exploited with the use of an exogenously introduced double-strand or single-strand DNA repair template to knock in or correct a mutation in the genome. Thus, CRISPR/Cas systems could be useful tools for therapeutic applications, but unfortunately prior published reports have demonstrated an efficiency of allele targeting of only 2%-4% in human stem cells (Mali et al., Science 339:823-826 (2013)).

SUMMARY OF THE INVENTION

Work described herein demonstrates methods of allele targeting using CRISPR/Cas systems resulting in mutant cells with efficiencies of up to 80%. These vastly improved methods permit CRISPR/Cas systems to be utilized effectively for the first time for therapeutic purposes. Methods of delivery of CRISPR/Cas systems to human stem cells are provided. In addition, methods of specifically identifying useful RNA guide sequences are provided, along with particular guide sequences useful in targeting specific genes (e.g., ADA, AK2, CD3D, DCLRE1C, IL2RG, IL7R, JAK3, LIG4, NHEJ1, PNP, PRKDC, RAG1, RAG2, ZAP70 and HBB). Moreover, methods of treatment (e.g., severe combined immunodeficiency, sickle cell disease, e.g., sickle cell anemia, beta thalassemia, etc.) utilizing the compositions and methods disclosed herein are provided. In some aspects, disclosed herein is a method for altering a target severe combined immunodeficiency (SCID)-associated polynucleotide sequence in a cell comprising contacting the SCID-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCID-associated polynucleotide sequence, wherein the target SCID-associated polynucleotide sequence is cleaved.

In some aspects, disclosed herein is a method for treating or preventing a disorder associated with expression of a SCID-associated polynucleotide sequence in a subject, the method comprising (a) altering a target SCID-associated polynucleotide sequence in a cell ex vivo by contacting the SCID-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCID-associated polynucleotide sequence, wherein the target SCID-associated polynucleotide sequence is cleaved, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the SCID-associated polynucleotide sequence.

In some aspects, disclosed herein is a method for treating or preventing a disorder associated with expression of a SCID-associated polynucleotide sequence in a subject, the method comprising altering a target SCID-associated polynucleotide sequence in a cell by contacting the SCID-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCID-associated polynucleotide sequence, and wherein the target SCID-associated polynucleotide sequence is cleaved, thereby treating or preventing a disorder associated with expression of the SCID-associated polynucleotide sequence.

In some aspects, disclosed herein is a method for simultaneously altering multiple target SCID-associated polynucleotide sequences in a cell comprising contacting the SCID-associated polynucleotide sequences with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target SCID-associated polynucleotide sequences, wherein the target SCID-associated polynucleotide sequences are cleaved.

In some aspects, disclosed herein is a method for treating or preventing a disorder associated with expression of SCID-associated polynucleotide sequences in a subject, the method comprising (a) altering target SCID-associated polynucleotide sequences in a cell ex vivo by contacting the SCID-associated polynucleotide sequences with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target SCID-associated polynucleotide sequences, wherein the target SCID-associated polynucleotide sequences are cleaved, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the SCID-associated polynucleotide sequences.

In some aspects, disclosed herein is a method for treating or preventing a disorder associated with expression of SCID-associated polynucleotide sequences in a subject, the method comprising altering target SCID-associated polynucleotide sequences in a cell by contacting the SCID-associated polynucleotide sequences with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target moieties of the target SCID-associated polynucleotide sequences, and wherein the target SCID-associated polynucleotide sequences are cleaved, thereby treating or preventing a disorder associated with expression of the SCID-associated polynucleotide sequences.

In some aspects, disclosed herein is a method for altering a target sickle cell disease (SCD)-associated polynucleotide sequence in a cell comprising contacting the SCD-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCD-associated polynucleotide sequence, wherein the target SCD-associated polynucleotide sequence is cleaved.

In some aspects, disclosed herein is a method for treating or preventing a disorder associated with expression of a SCD-associated polynucleotide sequence in a subject, the method comprising (a) altering a target SCD-associated polynucleotide sequence in a cell ex vivo by contacting the SCD-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCD-associated polynucleotide sequence, wherein the target SCD-associated polynucleotide sequence is cleaved, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the SCD-associated polynucleotide sequence.

In some aspects, disclosed herein is a method for treating or preventing a disorder associated with expression of a SCD-associated polynucleotide sequence in a subject, the method comprising altering a target SCD-associated polynucleotide sequence in a cell by contacting the SCD-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCD-associated polynucleotide sequence, and wherein the target SCD-associated polynucleotide sequence is cleaved, thereby treating or preventing a disorder associated with expression of the SCD-associated polynucleotide sequence.

In some aspects, disclosed herein is a method for simultaneously altering multiple target SCD-associated polynucleotide sequences in a cell comprising contacting the SCD-associated polynucleotide sequences with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target SCD-associated polynucleotide sequences, wherein the target SCD-associated polynucleotide sequences are cleaved.

In some aspects, disclosed herein is a method for treating or preventing a disorder associated with expression of SCD-associated polynucleotide sequences in a subject, the method comprising (a) altering target SCD-associated polynucleotide sequences in a cell ex vivo by contacting the SCD-associated polynucleotide sequences with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target SCD-associated polynucleotide sequences, wherein the target SCD-associated polynucleotide sequences are cleaved, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the SCD-associated polynucleotide sequences.

In some aspects, disclosed herein is a method for treating or preventing a disorder associated with expression of SCD-associated polynucleotide sequences in a subject, the method comprising altering target SCD-associated polynucleotide sequences in a cell by contacting the SCD-associated polynucleotide sequences with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target moieties of the target SCD-associated polynucleotide sequences, and wherein the target SCD-associated polynucleotide sequences are cleaved, thereby treating or preventing a disorder associated with expression of the SCD-associated polynucleotide sequences.

In some aspects, disclosed herein is a method for altering a target beta thalassemia-associated polynucleotide sequence in a cell comprising contacting the beta thalassemia-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target beta thalassemia-associated polynucleotide sequence, wherein the target beta thalassemia-associated polynucleotide sequence is cleaved.

In some aspects, disclosed herein is a method for treating or preventing a disorder associated with expression of a beta thalassemia-associated polynucleotide sequence in a subject, the method comprising (a) altering a target beta thalassemia-associated polynucleotide sequence in a cell ex vivo by contacting the beta thalassemia-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target beta thalassemia-associated polynucleotide sequence, wherein the target beta thalassemia-associated polynucleotide sequence is cleaved, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the beta thalassemia-associated polynucleotide sequence.

In some aspects, disclosed herein is a method for treating or preventing a disorder associated with expression of a beta thalassemia-associated polynucleotide sequence in a subject, the method comprising altering a target beta thalassemia-associated polynucleotide sequence in a cell by contacting the beta thalassemia-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target beta thalassemia-associated polynucleotide sequence, and wherein the target beta thalassemia-associated polynucleotide sequence is cleaved, thereby treating or preventing a disorder associated with expression of the beta thalassemia-associated polynucleotide sequence.

In some aspects, disclosed herein is a method for simultaneously altering multiple target beta thalassemia-associated polynucleotide sequences in a cell comprising contacting the beta thalassemia-associated polynucleotide sequences with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target beta thalassemia-associated polynucleotide sequences, wherein the target beta thalassemia-associated polynucleotide sequences are cleaved.

In some aspects, disclosed herein is a method for treating or preventing a disorder associated with expression of beta thalassemia-associated polynucleotide sequences in a subject, the method comprising (a) altering target beta thalassemia-associated polynucleotide sequences in a cell ex vivo by contacting the beta thalassemia-associated polynucleotide sequences with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target beta thalassemia-associated polynucleotide sequences, wherein the target beta thalassemia-associated polynucleotide sequences are cleaved, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the beta thalassemia-associated polynucleotide sequences.

In some aspects, disclosed herein is a method for treating or preventing a disorder associated with expression of beta thalassemia-associated polynucleotide sequences in a subject, the method comprising altering target beta thalassemia-associated polynucleotide sequences in a cell by contacting the beta thalassemia-associated polynucleotide sequences with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target moieties of the target beta thalassemia-associated polynucleotide sequences, and wherein the target beta thalassemia-associated polynucleotide sequences are cleaved, thereby treating or preventing a disorder associated with expression of the beta thalassemia-associated polynucleotide sequences.

In some embodiments, the Cas protein is Streptococcus pyogenes Cas9 protein or a functional portion thereof. In some embodiments, the functional portion comprises a combination of operably linked Cas9 protein functional domains selected from the group consisting of a DNA binding domain, at least one RNA binding domain, a helicase domain, and an endonuclease domain. In some embodiments, the functional domains form a complex. In some embodiments, the Cas protein is Cas9 protein from any bacterial species or functional portion thereof. In some embodiments, the functional portion comprises a combination of operably linked Cas9 protein functional domains selected from the group consisting of a DNA binding domain, at least one RNA binding domain, a helicase domain, and an endonuclease domain. In some embodiments, the functional domains form a complex.

In some embodiments, the Cas protein is complexed with the one to two ribonucleic acids. In some embodiments, the Cas protein is complexed with the multiple ribonucleic acids.

In some embodiments, the target motif is a 20-nucleotide DNA sequence. In some embodiments, each target motif is a 20-nucleotide DNA sequence. In some embodiments, the target motif is a 20-nucleotide DNA sequence beginning with G and immediately precedes an NGG motif recognized by the Cas protein. In some embodiments, each target motif is a 20-nucleotide DNA sequence beginning with G and immediately precedes an NGG motif recognized by the Cas protein. In some embodiments, the target motif is a 20-nucleotide DNA sequence and immediately precedes an NGG motif recognized by the Cas protein. In some embodiments, each target motif is a 20-nucleotide DNA sequence and immediately precedes an NGG motif recognized by the Cas protein. In some embodiments, the target motif is G(N)₁₉NGG. In some embodiments, each target motif is G(N)₁₉NGG. In some embodiments, the target motif is (N)₂₀NGG. In some embodiments, each target motif is (N)₂₀NGG.

In some embodiments, the target polynucleotide sequence is cleaved such that a double-strand break results. In some embodiments, each target polynucleotide sequence is cleaved such that a double-strand break results. In some embodiments, the target polynucleotide sequence is cleaved such that a single-strand break results. In some embodiments, each target polynucleotide sequence is cleaved such that a single-strand break results.

In some embodiments, the alteration is an indel. In some embodiments, the alteration results in reduced expression of the target polynucleotide sequence. In some embodiments, the alteration results in reduced expression of the target polynucleotide sequences. In some embodiments, the alteration results in a knock out of the target polynucleotide sequence. In some embodiments, the alteration results in a knock out of the target polynucleotide sequences. In some embodiments, the alteration results in correction of the target polynucleotide sequence from an undesired sequence to a desired sequence. In some embodiments, the alteration results in correction of the target polynucleotide sequences from undesired sequences to desired sequences. In some embodiments, the alteration is a homozygous alteration. In some embodiments, each alteration is a homozygous alteration.

In some embodiments, subsequent to cleavage of the target polynucleotide sequence, homology-directed repair occurs. In some embodiments, homology-directed repair is performed using an exogenously introduced DNA repair template. In some embodiments, the exogenously introduced DNA repair template is single-stranded. In some embodiments, the exogenously introduced DNA repair template is double-stranded. In some embodiments, subsequent to cleavage of the target polynucleotide sequences, homology-directed repair occurs. In some embodiments, homology-directed repair is performed using an exogenously introduced DNA repair template. In some embodiments, the exogenously introduced DNA repair template is single-stranded. In some embodiments, the exogenously introduced DNA repair template is double-stranded.

In some embodiments, the cell is a peripheral blood cell. In some embodiments, the cell is a stem cell or a pluripotent cell. In some embodiments, the cell is a hematopoietic stem cell. In some embodiments, the cell is a CD34⁺ cell. In some embodiments, the cell is a CD34⁺ mobilized peripheral blood cell. In some embodiments, the cell is a CD34⁺ cord blood cell. In some embodiments, the cell is a CD34⁺ bone marrow cell. In some embodiments, the cell is a CD34⁺CD38-Lineage-CD90⁺CD45RA⁻ cell.

In some embodiments, the target polynucleotide sequence is ADA. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 1 or at least a 12 nucleotide fragment thereof. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 1 or at least a 12 nucleotide fragment thereof.

In some embodiments, the target polynucleotide sequence is AK2. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 2 or at least a 12 nucleotide fragment thereof. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 2 or at least a 12 nucleotide fragment thereof.

In some embodiments, the target polynucleotide sequence is CD3D. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 3 or at least a 12 nucleotide fragment thereof. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 3 or at least a 12 nucleotide fragment thereof.

In some embodiments, the target polynucleotide sequence is DCLRE1C. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 4 or at least a 12 nucleotide fragment thereof. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 4 or at least a 12 nucleotide fragment thereof.

In some embodiments, the target polynucleotide sequence is IL2RG. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 6 or at least a 12 nucleotide fragment thereof. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 6 or at least a 12 nucleotide fragment thereof.

In some embodiments, the target polynucleotide sequence is IL7R. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 7 or at least a 12 nucleotide fragment thereof. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 7 or at least a 12 nucleotide fragment thereof.

In some embodiments, the target polynucleotide sequence is JAK3. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 8 or at least a 12 nucleotide fragment thereof. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 8 or at least a 12 nucleotide fragment thereof.

In some embodiments, the target polynucleotide sequence is LIG4. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 9 or at least a 12 nucleotide fragment thereof. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 9 or at least a 12 nucleotide fragment thereof.

In some embodiments, the target polynucleotide sequence is NHEJ1. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 10 or at least a 12 nucleotide fragment thereof. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 10 or at least a 12 nucleotide fragment thereof.

In some embodiments, the target polynucleotide sequence is PNP. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 11 or at least a 12 nucleotide fragment thereof. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 11 or at least a 12 nucleotide fragment thereof.

In some embodiments, the target polynucleotide sequence is PRKDC. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 12 or at least a 12 nucleotide fragment thereof. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 12 or at least a 12 nucleotide fragment thereof.

In some embodiments, the target polynucleotide sequence is RAG1. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 13 or at least a 12 nucleotide fragment thereof. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 13 or at least a 12 nucleotide fragment thereof.

In some embodiments, the target polynucleotide sequence is RAG2. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 14 or at least a 12 nucleotide fragment thereof. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 14 or at least a 12 nucleotide fragment thereof.

In some embodiments, the target polynucleotide sequence is ZAP70. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 15 or at least a 12 nucleotide fragment thereof. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 15 or at least a 12 nucleotide fragment thereof.

In some embodiments, the target polynucleotide sequence is HBB. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 5 or at least a 12 nucleotide fragment thereof. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 5 or at least a 12 nucleotide fragment thereof. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321) or at least a 12 nucleotide fragment thereof. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to ribonucleic acid sequence GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321) or at least a 12 nucleotide fragment thereof.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of ADA. In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 1 or at least 12 nucleotide fragments thereof. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 1 or at least 12 nucleotide fragments thereof.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of AK2. In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 2 or at least 12 nucleotide fragments thereof. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 2 or at least 12 nucleotide fragments thereof.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of CD3D. In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 3 or at least 12 nucleotide fragments thereof. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 3 or at least 12 nucleotide fragments thereof.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of DCLRE1C. In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 4 or at least 12 nucleotide fragments thereof. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 4 or at least 12 nucleotide fragments thereof.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of IL2RG. In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 6 or at least 12 nucleotide fragments thereof. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 6 or at least 12 nucleotide fragments thereof.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of IL7R. In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 7 or at least 12 nucleotide fragments thereof. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 7 or at least 12 nucleotide fragments thereof.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of JAK3. In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 8 or at least 12 nucleotide fragments thereof. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 8 or at least 12 nucleotide fragments thereof.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of LIG4. In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 9 or at least 12 nucleotide fragments thereof. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 9 or at least 12 nucleotide fragments thereof.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of NHEJ1. In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 10 or at least 12 nucleotide fragments thereof. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 10 or at least 12 nucleotide fragments thereof.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of PNP. In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 11 or at least 12 nucleotide fragments thereof. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 11 or at least 12 nucleotide fragments thereof.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of PRKDC. In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 12 or at least 12 nucleotide fragments thereof. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 12 or at least 12 nucleotide fragments thereof.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of RAG1. In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 13 or at least 12 nucleotide fragments thereof. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 13 or at least 12 nucleotide fragments thereof.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of RAG2. In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 14 or at least 12 nucleotide fragments thereof. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 14 or at least 12 nucleotide fragments thereof.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of ZAP70. In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 15 or at least 12 nucleotide fragments thereof. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 15 or at least 12 nucleotide fragments thereof.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of HBB. In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 5 or at least 12 nucleotide fragments thereof. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 5 or at least 12 nucleotide fragments thereof.

In some embodiments, the target polynucleotide sequences comprise at least a portion of any combination of target polynucleotide sequences selected from the group consisting of ADA, AK2, CD3D, DCLRE1C, IL2RG, IL7R, JAK3, LIG4, NHEJ1, PNP, PRKDC, RAG1, RAG2, and ZAP70. In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIGS. 1-15 or at least a 12 nucleotide fragment thereof. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIGS. 1-15 or at least a 12 nucleotide fragment thereof.

In some embodiments, the disorder is SCID. In some embodiments, the disorder is sickle cell disease. In some embodiments, the disorder is beta thalassemia.

In some embodiments, the one to two ribonucleic acids are designed to hybridize to a target motif immediately adjacent to a deoxyribonucleic acid motif recognized by the Cas protein. In some embodiments, each of the one to two ribonucleic acids are designed to hybridize to target motifs immediately adjacent to deoxyribonucleic acid motifs recognized by the Cas protein which flank a mutant allele located between the target motifs. In some embodiments, the multiple ribonucleic acids are designed to hybridize to target motifs immediately adjacent to deoxyribonucleic acid motifs recognized by the Cas protein. In some embodiments, the multiple ribonucleic acids are designed to hybridize to target motifs immediately adjacent to deoxyribonucleic acid motifs recognized by the Cas protein which flank mutant alleles located between the target motifs. In some embodiments, the one to two ribonucleic acids are selected to minimize hybridization with nucleic acid sequences other than the target polynucleotide sequence. In some embodiments, the multiple ribonucleic acids are selected to minimize hybridization with nucleic acid sequences other than the target polynucleotide sequence. In some embodiments, the target motif is selected such that it contains at least two mismatches when compared with all other genomic nucleotide sequences in the cell. In some embodiments, each target motif is selected such that it contains at least two mismatches when compared with all other genomic nucleotide sequences in the cell. In some embodiments, the target motif is selected such that it contains at least one mismatch when compared with all other genomic nucleotide sequences in the cell. In some embodiments, the target motif is selected such that it contains at least one mismatch when compared with all other genomic nucleotide sequences in the cell. In some embodiments, the one to two ribonucleic acids hybridize to a target motif that it contains at least two mismatches when compared with all other genomic nucleotide sequences in the cell. In some embodiments, each of the multiple ribonucleic acids hybridize to target motifs that contain at least two mismatches when compared with all other genomic nucleotide sequences in the cell. In some embodiments, the one to two ribonucleic acids hybridize to a target motif that contains at least one mismatch when compared with all other genomic nucleotide sequences in the cell. In some embodiments, each of the multiple ribonucleic acids hybridize to target motifs that contain at least one mismatch when compared with all other genomic nucleotide sequences in the cell.

In some embodiments, the Cas protein and the one to two ribonucleic acids are contained in a nanoparticle. In some embodiments, the Cas protein and the one to two ribonucleic acids are contained in a lipid nanoparticle. In some embodiments, the lipid nanoparticle comprises at least one of a cationic lipid, a neutral lipid, an amino lipid, a sterol, and a PEG or PEG-modified lipid. In some embodiments, the cationic lipid is selected from the group consisting of ALNY-100, C12-200, DODAC, DDAB, DOTAP, DOTMA, DODMA, DLinDMA, DLenDMA, DLin-C-DAP, DLin-DAC, DLin-MA, DLinDAP, DLin-S-DMA, DLin-2-DMAP, DLin-TMA.Cl, DLin-TAP.Cl, DLin-MPZ, DLinAP, DOAP, DLin-EG-DMA, DLinDMA, DLin-K-DMA, DLin-KC2-DMA, DLin-M-C3-DMA, KC2, MC3, DOTAP.Cl, DOSPA, DOGS, DOPE, DODAP, DMRIE, XTC, and mixtures thereof. In some embodiments, the neutral lipid is selected from the group consisting of DPSC, DPPC, POPC, DOPE, SM, and mixtures thereof. In some embodiments, the PEG-modified lipid is selected from the group consisting of PEG-DMG, PEG-CerC14, PEG-CerC20, and mixtures thereof. In some embodiments, the Cas protein and the multiple ribonucleic acids are contained in nanoparticles. In some embodiments, the Cas protein and the multiple ribonucleic acids are contained in lipid nanoparticles. In some embodiments, the lipid nanoparticles comprise at least one of a cationic lipid, a neutral lipid, an amino lipid, a sterol, and a PEG or PEG-modified lipid. In some embodiments, the cationic lipid is selected from the group consisting of ALNY-100, C12-200, DODAC, DDAB, DOTAP, DOTMA, DODMA, DLinDMA, DLenDMA, DLin-C-DAP, DLin-DAC, DLin-MA, DLinDAP, DLin-S-DMA, DLin-2-DMAP, DLin-TMA.Cl, DLin-TAP.Cl, DLin-MPZ, DLinAP, DOAP, DLin-EG-DMA, DLinDMA, DLin-K-DMA, DLin-KC2-DMA, DLin-M-C3-DMA, KC2, MC3, DOTAP.Cl, DOSPA, DOGS, DOPE, DODAP, DMRIE, XTC, and mixtures thereof. In some embodiments, the neutral lipid is selected from the group consisting of DPSC, DPPC, POPC, DOPE, SM, and mixtures thereof. In some embodiments, the PEG-modified lipid is selected from the group consisting of PEG-DMG, PEG-CerC14, PEG-CerC20, and mixtures thereof.

In some embodiments, the efficiency of alteration at each loci is from about 50% to about 80%. In some embodiments, the efficiency of alteration is at least about 5%. In some embodiments, the efficiency of alteration is at least about 10%. In some embodiments, the efficiency of alteration is from about 50% to about 80%.

In some embodiments, the Cas protein is encoded by a modified nucleic acid. In some embodiments, the modified nucleic acid comprises a ribonucleic acid containing at least one modified nucleotide selected from the group consisting of pseudouridine, 5-methylcytodine, 2-thio-uridine, 5-methyluridine-5′-triphosphate, 4-thiouridine-5′-triphosphate, 5,6-dihydrouridine-5′-triphosphate, and 5-azauridine-5′-triphosphate. In some embodiments, at least one of the ribonucleic acids is a modified ribonucleic acid comprising one to two modified nucleotides selected from the group consisting of pseudouridine, 5-methylcytodine, 2-thio-uridine, 5-methyluridine-5′-triphosphate, 4-thiouridine-5′-triphosphate, 5,6-dihydrouridine-5′-triphosphate, and 5-azauridine-5′-triphosphate.

In some embodiments, any of the Cas protein or the ribonucleic acids are expressed from a plasmid. In some embodiments, any of the Cas protein or the ribonucleic acids are expressed using a promoter optimized for increased expression in stem cells. In some embodiments, the promoter is selected from the group consisting of a Cytomegalovirus (CMV) early enhancer element and a chicken beta-actin promoter, a chicken beta-actin promoter, an elongation factor-1 alpha promoter, and a ubiquitin promoter.

In some embodiments, the method further comprises selecting cells that express the Cas protein. In some embodiments, selecting cells comprises FACS. In some embodiments, FACs is used to select cells which co-express Cas and a fluorescent protein selected from the group consisting of green fluorescent protein and red fluorescent protein.

In some aspects, disclosed herein is a method for altering a target SCID-associated polynucleotide sequence in a cell comprising contacting the SCID-associated polynucleotide sequence in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCID-associated polynucleotide sequence, wherein the target SCID-associated polynucleotide sequence is cleaved, and wherein the efficiency of alteration of cells that express Cas protein is from about 8% to about 80%.

In some aspects, disclosed herein is a method for altering a target SCD-associated polynucleotide sequence in a cell comprising contacting the SCD-associated polynucleotide sequence in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCD-associated polynucleotide sequence, wherein the target SCD-associated polynucleotide sequence is cleaved, and wherein the efficiency of alteration of cells that express Cas protein is from about 8% to about 80%.

In some aspects, disclosed herein is a method for altering a target beta thalassemia-associated polynucleotide sequence in a cell comprising contacting the beta thalassemia-associated polynucleotide sequence in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target beta thalassemia-associated polynucleotide sequence, wherein the target beta thalassemia-associated polynucleotide sequence is cleaved, and wherein the efficiency of alteration of cells that express Cas protein is from about 8% to about 80%.

In some aspects, disclosed herein is a method for treating or preventing a disorder associated with expression of a SCID-associated polynucleotide sequence in a subject, the method comprising (a) altering a target SCID-associated polynucleotide sequence in a cell ex vivo by contacting the SCID-associated polynucleotide sequence in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCID-associated polynucleotide sequence, wherein the target SCID-associated polynucleotide sequence is cleaved, and wherein the efficiency of alteration is from about 8% to about 80%, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the SCID-associated polynucleotide sequence.

In some aspects, disclosed herein is a method for treating or preventing a disorder associated with expression of a SCD-associated polynucleotide sequence in a subject, the method comprising (a) altering a target SCD-associated polynucleotide sequence in a cell ex vivo by contacting the SCD-associated polynucleotide sequence in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCD-associated polynucleotide sequence, wherein the target SCD-associated polynucleotide sequence is cleaved, and wherein the efficiency of alteration is from about 8% to about 80%, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the SCD-associated polynucleotide sequence.

In some aspects, disclosed herein is a method for treating or preventing a disorder associated with expression of a beta thalassemia-associated polynucleotide sequence in a subject, the method comprising (a) altering a target beta thalassemia-associated polynucleotide sequence in a cell ex vivo by contacting the beta thalassemia-associated polynucleotide sequence in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target beta thalassemia-associated polynucleotide sequence, wherein the target beta thalassemia-associated polynucleotide sequence is cleaved, and wherein the efficiency of alteration is from about 8% to about 80%, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the beta thalassemia-associated polynucleotide sequence.

In some aspects, disclosed herein is a method for simultaneously altering multiple target SCID-associated polynucleotide sequences in a cell comprising contacting the SCID-associated polynucleotide sequences in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target SCID-associated polynucleotide sequences, wherein the target SCID-associated polynucleotide sequences are cleaved, and wherein the efficiency of alteration of cells that express Cas protein is from about 8% to about 80%.

In some aspects, disclosed herein is a method for simultaneously altering multiple target SCD-associated polynucleotide sequences in a cell comprising contacting the SCD-associated polynucleotide sequences in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target SCD-associated polynucleotide sequences, wherein the target SCD-associated polynucleotide sequences are cleaved, and wherein the efficiency of alteration of cells that express Cas protein is from about 8% to about 80%.

In some aspects, disclosed herein is a method for simultaneously altering multiple target beta thalassemia-associated polynucleotide sequences in a cell comprising contacting the beta thalassemia-associated polynucleotide sequences in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target beta thalassemia-associated polynucleotide sequences, wherein the target beta thalassemia-associated polynucleotide sequences are cleaved, and wherein the efficiency of alteration of cells that express Cas protein is from about 8% to about 80%.

In some aspects, disclosed herein is a method for treating or preventing a disorder associated with expression of SCID-associated polynucleotide sequences in a subject, the method comprising (a) altering target SCID-associated polynucleotide sequences in a cell ex vivo by contacting the SCID-associated polynucleotide sequences in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target SCID-associated polynucleotide sequences, wherein the target SCID-associated polynucleotide sequences are cleaved, and wherein the efficiency of alteration of cells that express Cas protein is from about 8% to about 80%, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the SCID-associated polynucleotide sequences.

In some aspects, disclosed herein is a method for treating or preventing a disorder associated with expression of SCD-associated polynucleotide sequences in a subject, the method comprising (a) altering target SCD-associated polynucleotide sequences in a cell ex vivo by contacting the SCD-associated polynucleotide sequences in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target SCD-associated polynucleotide sequences, wherein the target SCD-associated polynucleotide sequences are cleaved, and wherein the efficiency of alteration of cells that express Cas protein is from about 8% to about 80%, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the SCD-associated polynucleotide sequences.

In some aspects, disclosed herein is a method for treating or preventing a disorder associated with expression of beta thalassemia-associated polynucleotide sequences in a subject, the method comprising (a) altering target beta thalassemia-associated polynucleotide sequences in a cell ex vivo by contacting the beta thalassemia-associated polynucleotide sequences in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target SCID-associated polynucleotide sequences, wherein the target beta thalassemia-associated polynucleotide sequences are cleaved, and wherein the efficiency of alteration of cells that express Cas protein is from about 8% to about 80%, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the beta thalassemia-associated polynucleotide sequences.

In some aspects, disclosed herein is a composition, comprising at least one ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIGS. 1-15 or at least a 12 nucleotide fragment thereof.

In some aspects, disclosed herein is a composition, comprising at least one ribonucleic acid comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIGS. 1-15 or at least a 12 nucleotide fragment thereof.

In some embodiments, the at least one ribonucleic acid is contained in a nanoparticle. In some embodiments, the at least one ribonucleic acid is contained in a lipid nanoparticle. In some embodiments, the lipid nanoparticle comprises at least one of a cationic lipid, a neutral lipid, an amino lipid, a sterol, and a PEG or PEG-modified lipid. In some embodiments, the cationic lipid is selected from the group consisting of ALNY-100, C12-200, DODAC, DDAB, DOTAP, DOTMA, DODMA, DLinDMA, DLenDMA, DLin-C-DAP, DLin-DAC, DLin-MA, DLinDAP, DLin-S-DMA, DLin-2-DMAP, DLin-TMA.Cl, DLin-TAP.Cl, DLin-MPZ, DLinAP, DOAP, DLin-EG-DMA, DLinDMA, DLin-K-DMA, DLin-KC2-DMA, DLin-M-C3-DMA, KC2, MC3, DOTAP.Cl, DOSPA, DOGS, DOPE, DODAP, DMRIE, XTC, and mixtures thereof. In some embodiments, the neutral lipid is selected from the group consisting of DPSC, DPPC, POPC, DOPE, SM, and mixtures thereof. In some embodiments, the PEG-modified lipid is selected from the group consisting of PEG-DMG, PEG-CerC14, PEG-CerC20, and mixtures thereof. In some embodiments, at least one of the ribonucleic acids is a modified ribonucleic acid comprising one to two modified nucleotides selected from the group consisting of pseudouridine, 5-methylcytodine, 2-thio-uridine, 5-methyluridine-5′-triphosphate, 4-thiouridine-5′-triphosphate, 5,6-dihydrouridine-5′-triphosphate, and 5-azauridine-5′-triphosphate.

In some embodiments, a composition further comprises a nucleic acid sequence encoding a Cas protein.

In some embodiments, a composition further comprises a nucleic acid sequence encoding a Cas9 protein or a functional portion thereof. In some embodiments, the nucleic acid comprises a modified ribonucleic acid comprising at least one modified nucleotide selected from the group consisting of pseudouridine, 5-methylcytodine, 2-thio-uridine, 5-methyluridine-5′-triphosphate, 4-thiouridine-5′-triphosphate, 5,6-dihydrouridine-5′-triphosphate, and 5-azauridine-5′-triphosphate.

In some aspects, disclosed herein is a composition, comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIGS. 1-15 or at least a 12 nucleotide fragment thereof.

In some aspects, disclosed herein is a composition, n comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid sequence comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIGS. 1-15 or at least a 12 nucleotide fragment thereof.

In some embodiments, the composition further comprises a nucleic acid sequence encoding a fluorescent protein selected from the group consisting of green fluorescent protein and red fluorescent protein. In some embodiments, the composition further comprises a promoter operably linked to the chimeric nucleic acid. In some embodiments, the promoter is optimized for increased expression in human stem cells. In some embodiments, the promoter is selected from the group consisting of a Cytomegalovirus (CMV) early enhancer element and a chicken beta-actin promoter, a chicken beta-actin promoter, an elongation factor-1 alpha promoter, and a ubiquitin promoter.

In some embodiments, the chimeric nucleic acid is contained in a nanoparticle. In some embodiments, the chimeric nucleic acid is contained in a lipid nanoparticle. In some embodiments, the lipid nanoparticle comprises at least one of a cationic lipid, a neutral lipid, an amino lipid, a sterol, and a PEG or PEG-modified lipid. In some embodiments, the cationic lipid is selected from the group consisting of ALNY-100, C12-200, DODAC, DDAB, DOTAP, DOTMA, DODMA, DLinDMA, DLenDMA, DLin-C-DAP, DLin-DAC, DLin-MA, DLinDAP, DLin-S-DMA, DLin-2-DMAP, DLin-TMA.Cl, DLin-TAP.Cl, DLin-MPZ, DLinAP, DOAP, DLin-EG-DMA, DLinDMA, DLin-K-DMA, DLin-KC2-DMA, DLin-M-C3-DMA, KC2, MC3, DOTAP.Cl, DOSPA, DOGS, DOPE, DODAP, DMRIE, XTC, and mixtures thereof. In some embodiments, the neutral lipid is selected from the group consisting of DPSC, DPPC, POPC, DOPE, SM, and mixtures thereof. In some embodiments, the PEG-modified lipid is selected from the group consisting of PEG-DMG, PEG-CerC14, PEG-CerC20, and mixtures thereof. In some embodiments, the chimeric nucleic acid comprises at least one modified nucleotide selected from the group consisting of pseudouridine, 5-methylcytodine, 2-thio-uridine, 5-methyluridine-5′-triphosphate, 4-thiouridine-5′-triphosphate, 5,6-dihydrouridine-5′-triphosphate, and 5-azauridine-5′-triphosphate.

In some embodiments, the Cas protein comprises a Cas9 protein or a functional portion thereof.

In some aspects, disclosed herein is a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIGS. 1-15, a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of FIGS. 1-15 or at least a 12 nucleotide fragment thereof.

In some embodiments, the kit further comprises one or more cell lines, cultures, or populations selected from the group consisting of human pluripotent cells, primary human cells, and non-transformed cells. In some embodiments, the kit further comprises a DNA repair template selected from the group consisting of an ADA DNA repair template, a AK2 DNA repair template, a CD3D DNA repair template, a DCLRE1C DNA repair template, a IL2RG DNA repair template, IL7R DNA repair template, a JAK3 DNA repair template, a LIG4 DNA repair template, a NHEJ1 DNA repair template, a PNP DNA repair template, a PRKDC DNA repair template, a RAG1 DNA repair template, a RAG2 DNA repair template, a ZAP70 DNA repair template, and a HBB DNA repair template.

In some aspects, the disclosure provides a composition comprising at least one ribonucleic acid having a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321) or at least a 12 nucleotide fragment thereof. In some aspects, the disclosure provides a composition comprising at least one ribonucleic acid comprising a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321) or at least a 12 nucleotide fragment thereof. In some embodiments, the at least one ribonucleic acid is contained in a nanoparticle. In some embodiments, the at least one ribonucleic acid is contained in a lipid nanoparticle. In some embodiments, the lipid nanoparticle comprises at least one of a cationic lipid, a neutral lipid, an amino lipid, a sterol, and a PEG or PEG-modified lipid. In some embodiments, the cationic lipid is selected from the group consisting of ALNY-100, C12-200, DODAC, DDAB, DOTAP, DOTMA, DODMA, DLinDMA, DLenDMA, DLin-C-DAP, DLin-DAC, DLin-MA, DLinDAP, DLin-S-DMA, DLin-2-DMAP, DLin-TMA.Cl, DLin-TAP.Cl, DLin-MPZ, DLinAP, DOAP, DLin-EG-DMA, DLinDMA, DLin-K-DMA, DLin-KC2-DMA, DLin-M-C3-DMA, KC2, MC3, DOTAP.Cl, DOSPA, DOGS, DOPE, DODAP, DMRIE, XTC, and mixtures thereof. In some embodiments, the neutral lipid is selected from the group consisting of DPSC, DPPC, POPC, DOPE, SM, and mixtures thereof. In some embodiments, the PEG-modified lipid is selected from the group consisting of PEG-DMG, PEG-CerC14, PEG-CerC20, and mixtures thereof. In some embodiments, at least one of the ribonucleic acids is a modified ribonucleic acid comprising one to two modified nucleotides selected from the group consisting of pseudouridine, 5-methylcytodine, 2-thio-uridine, 5-methyluridine-5′-triphosphate, 4-thiouridine-5′-triphosphate, 5,6-dihydrouridine-5′-triphosphate, and 5-azauridine-5′-triphosphate. In some embodiments, the composition includes a nucleic acid sequence encoding a Cas protein. In some embodiments, the composition includes a nucleic acid sequence encoding a Cas9 protein or a functional portion thereof. In some embodiments, the nucleic acid comprises a modified ribonucleic acid comprising at least one modified nucleotide selected from the group consisting of pseudouridine, 5-methylcytodine, 2-thio-uridine, 5-methyluridine-5′-triphosphate, 4-thiouridine-5′-triphosphate, 5,6-dihydrouridine-5′-triphosphate, and 5-azauridine-5′-triphosphate.

In some aspects, the disclosure provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid having a ribonucleic acid sequences of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321) or at least a 12 nucleotide fragment thereof. In some aspects, the disclosure provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid sequence comprising a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321) or at least a 12 nucleotide fragment thereof. In some embodiments, the composition includes a nucleic acid sequence encoding a fluorescent protein selected from the group consisting of green fluorescent protein and red fluorescent protein. In some embodiments, the composition includes a promoter operably linked to the chimeric nucleic acid. In some embodiments, the promoter is optimized for increased expression in human stem cells. In some embodiments, the promoter is selected from the group consisting of a Cytomegalovirus (CMV) early enhancer element and a chicken beta-actin promoter, a chicken beta-actin promoter, an elongation factor-1 alpha promoter, and a ubiquitin promoter. In some embodiments, the chimeric nucleic acid is contained in a nanoparticle. In some embodiments, the chimeric nucleic acid is contained in a lipid nanoparticle. In some embodiments, the lipid nanoparticle comprises at least one of a cationic lipid, a neutral lipid, an amino lipid, a sterol, and a PEG or PEG-modified lipid. In some embodiments, the cationic lipid is selected from the group consisting of ALNY-100, C12-200, DODAC, DDAB, DOTAP, DOTMA, DODMA, DLinDMA, DLenDMA, DLin-C-DAP, DLin-DAC, DLin-MA, DLinDAP, DLin-S-DMA, DLin-2-DMAP, DLin-TMA.Cl, DLin-TAP.Cl, DLin-MPZ, DLinAP, DOAP, DLin-EG-DMA, DLinDMA, DLin-K-DMA, DLin-KC2-DMA, DLin-M-C3-DMA, KC2, MC3, DOTAP.Cl, DOSPA, DOGS, DOPE, DODAP, DMRIE, XTC, and mixtures thereof. In some embodiments, the neutral lipid is selected from the group consisting of DPSC, DPPC, POPC, DOPE, SM, and mixtures thereof. In some embodiments, the PEG-modified lipid is selected from the group consisting of PEG-DMG, PEG-CerC14, PEG-CerC20, and mixtures thereof. In some embodiments, the chimeric nucleic acid comprises at least one modified nucleotide selected from the group consisting of pseudouridine, 5-methylcytodine, 2-thio-uridine, 5-methyluridine-5′-triphosphate, 4-thiouridine-5′-triphosphate, 5,6-dihydrouridine-5′-triphosphate, and 5-azauridine-5′-triphosphate. In some embodiments, the Cas protein comprises a Cas9 protein or a functional portion thereof.

In some aspects, the disclosure provides a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321), a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321) or at least a 12 nucleotide fragment thereof. In some embodiments, the kit includes one or more cell lines, cultures, or populations selected from the group consisting of human pluripotent cells, primary human cells, and non-transformed cells. In some embodiments, the kit includes a HBB DNA repair template.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows exemplary guide RNA sequences useful when the target polynucleotide sequence is human ADA.

FIG. 2 shows exemplary guide RNA sequences useful when the target polynucleotide sequence is human AK2.

FIG. 3 shows exemplary guide RNA sequences useful when the target polynucleotide sequence is human CD3D.

FIG. 4 shows exemplary guide RNA sequences useful when the target polynucleotide sequence is human DCLRE1C.

FIG. 5 shows exemplary guide RNA sequences useful when the target polynucleotide sequence is human HBB.

FIG. 6 shows exemplary guide RNA sequences useful when the target polynucleotide sequence is human IL2RG.

FIG. 7 shows exemplary guide RNA sequences useful when the target polynucleotide sequence is human IL7R.

FIG. 8 shows exemplary guide RNA sequences useful when the target polynucleotide sequence is human JAK3.

FIG. 9 shows exemplary guide RNA sequences useful when the target polynucleotide sequence is human LIG4.

FIG. 10 shows exemplary guide RNA sequences useful when the target polynucleotide sequence is human NHEJ1.

FIG. 11 shows exemplary guide RNA sequences useful when the target polynucleotide sequence is human PNP.

FIG. 12 shows exemplary guide RNA sequences useful when the target polynucleotide sequence is human PRKDC.

FIG. 13 shows exemplary guide RNA sequences useful when the target polynucleotide sequence is human RAG1.

FIG. 14 shows exemplary guide RNA sequences useful when the target polynucleotide sequence is human RAG2.

FIG. 15 shows exemplary guide RNA sequences useful when the target polynucleotide sequence is human ZAP70.

FIG. 16 shows an exemplary amino acid sequence of a Cas protein. Yellow highlights indicate Ruv-C-like domain. Underlining indicates HNH nuclease domain.

FIGS. 17A, 17B, 17C and 17D demonstrate targeted capture and extremely deep sequencing of on-target and predicted off-target sites in CD34+ HPSCs. FIG. 17A is a schematic overview of targeted capture and deep sequencing of on-target and predicted off-target sites (red bar). A 500 bp flanking cutting site (in yellow) were included in sequence analysis for detection of structural rearrangements, including translocations. Probe sets are indicated in blue. FIG. 17B features plots showing consistent sequencing depth coverage at both on-target (left panel) and off-target (right panel) sites, achieving a coverage exceeding 3,000× for all on-target sites. Decrease in sequencing depth at the on-target sites in dual-gRNA libraries is marked by arrow, supporting predicted deletions (bottom left; i=35 bp, ii=205 bp, iii=205 bp).

FIG. 17C is a Table depicting the precise estimation of on-target mutation allele frequencies by capture sequencing. Notably, the observed rate of effective null mutation exceeds previous estimates by PCR validation of predictable deletions, as smaller InDels and inversions also occur at appreciable frequencies. FIG. 17D is a Table depicting the estimation of mutation frequencies at predicted off-target sites (*One off-target site was statistically different from controls following correction for multiple comparisons; p≤7.6×10⁻¹¹). N-fold enrichment is determined based on the ratio of non-reference reads in treated libraries compared to untreated library. Each value represents the average of all off-target sites for a given single gRNA or dual-gRNA experiment. Enrichment of 1 is equivalent to baseline (untreated control). **For reference to on-target enrichments, on-target combined represents the proportion of non-reference reads (including single and dual-gRNA treatments using a given gRNA) to total reads at on-target sites in treatment compared to control.

FIGS. 18A and 18B demonstrate potential off-target sites identified in CCR5 homologue CCR2 and analysis of events detected at the single off-target site in which mutagenesis was significantly detected above background. FIG. 18A depicts a sequence alignment of CCR5 gRNAs utilized in this study in relation to the closest homologous sequence in CCR2 showing mismatched nucleotides in bold. Noteworthy is the fact that guide crCCR5_B, which yielded the sole significantly detected off-target mutagenesis in CCR2 (detailed in panel B), has 3 nucleotide mismatches, which are distal to the PAM (underlined) and seed (grey box) sequences. FIG. 18B is a Table depicting in-depth analyses of all sequence reads at the single off-target site in which mutagenesis was significantly detected above background in both capture libraries treated with the associated gRNA (B; libraries treated with single gRNA crCCR5_B & dual-gRNA crCCR5_A+B), as well as the library treated with gRNA crCCR5_A as a comparison. Total off-target mutation frequency at this site was 0.6% in the single gRNA treatment (crCCR5_B) and notably decreased to 0.24% in the dual gRNA treatment (crCCR5_A+B) in which gRNA plasmid concentration of each gRNA was half of that utilized in single gRNA treatments.

FIGS. 19A and 19B demonstrate the generation of Fgm knockout mice by a CRISPR/Cas system employing a modified Cas9 mRNA. FIG. 19A is a schematic illustrating the steps employed to generate Fgm knockout mice using the CRISPR/Cas system employing the Cas9 modified RNA. FIG. 19B shows part of a gel picture depicting results from PCR screening of surviving pups for genetic mutations resulting from genomic editing using the CRISPR/Cas system and the modified Cas9 mRNA.

FIG. 20 shows predicted gRNA mapping in Ensembl GRCh37v71.

FIG. 21 shows guide pair crCCR5_A+B on-target alleles.

FIG. 22 shows guide pair crCCR5_C+D on-target alleles.

FIG. 23 shows guide pair crCCR5_D+Q on-target alleles.

FIG. 24 shows off-target sites with statistically significant mutational burden.

FIG. 25 shows a comparison of on- and off-target mutational burdens.

DETAILED DESCRIPTION OF THE INVENTION

Work described herein demonstrates methods of allele targeting using CRISPR/Cas systems resulting in mutant cells with efficiencies of up to 80%. These vastly improved methods permit CRISPR/Cas systems to be utilized effectively for the first time for therapeutic purposes. Methods of delivery of CRISPR/Cas systems to human stem cells are provided. In addition, methods of specifically identifying useful RNA guide sequences are provided, along with particular guide sequences useful in targeting specific genes (e.g., ADA, AK2, CD3D, DCLRE1C, HBB, IL2RG, IL7R, JAK3, LIG4, NHEJ1, PNP, PRKDC, RAG1, RAG2, and ZAP70). Moreover, methods of treatment (e.g., methods of treating severe combined immunodeficiency, sickle cell disease, and beta thalassemia) utilizing the compositions and methods disclosed herein are provided.

In one aspect, the present invention provides methods for altering target polynucleotide sequences in a cell.

In certain embodiments, the target polynucleotide sequence is a severe combined immunodeficiency (SCID)-associated polynucleotide sequence. In such embodiments, a method for altering a target polynucleotide sequence in a cell comprises a method for altering a target SCID-associated polynucleotide sequence. As used herein, “severe combined immunodeficiency-associated polynucleotide sequence” and “SCID-associated polynucleotide sequence” are used interchangeably to refer to a polynucleotide sequence of a gene displaying one or more mutations associated with SCID. As used herein “severe combined immunodeficiency” and “SCID” refer to a genetic disorder characterized by dysfunctional T-lymphocytes causing a defective antibody response due to either a direct involvement with B lymphocytes or aberrant B lymphocyte activation resulting from non-functional T-helper cells. SCID encompasses dysfunctional B and T cell responses of the adaptive immune system due to mutations in one or more genes, including, but not limited to, ADA, AK2, CD3D, DCLRE1C, IL2RG, IL7R, JAK3, LIG4, NHEJ1, PNP, PRKDC, RAG1, RAG2, and ZAP70. As such, a “SCID-associated polynucleotide sequence” encompasses nucleotide sequences of any of these genes along with variant or mutant forms thereof.

An exemplary method for altering a target severe combined immunodeficiency (SCID)-associated polynucleotide sequence in a cell comprises contacting the SCID-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCID-associated polynucleotide sequence, wherein the target SCID associated polynucleotide sequence is cleaved. In some embodiments, the efficiency of alteration of cells that express Cas protein is from about 50% to about 80%.

In some embodiments, the SCID-associated polynucleotide sequence is adenosine deaminase (ADA) or a variant thereof. An exemplary ADA sequence is a human ADA sequence (NCBI Gene ID: 100). Those skilled in the art will appreciate that the guide sequences shown in FIG. 1 can be used with the CRISPR/Cas systems of the present invention to alter target polynucleotide sequences of human ADA.

It should also be appreciated that altering a target polynucleotide sequence of ADA can be used to treat any abnormal phenotype associated with an altered ADA polynucleotide sequence. Table 1 below shows gene phenotype relationships identified by the Online Mendelian Inheritance in Man® (OMIM®) database. Further information regarding a phenotype listed in Table 1 is publicly accessible by querying OMIM for the search term “ADA” and then clicking on a hyperlink in the “Phenotype MIM number” column corresponding to the phenotype listed.

TABLE 1 ADA Gene Phenotype Relationships Phenotype Location Phenotype MIM number 20q13.12 Adenosine deaminase deficiency, partial 102700 Severe combined immunodeficiency due 102700 to ADA deficiency

In some embodiments, the SCID-associated polynucleotide sequence is adenylate kinase 2 (AK2) or a variant thereof. An exemplary AK2 sequence is a human AK2 sequence (NCBI Gene ID: 204, also known as ADK2 and AK 2). In some embodiments, the human AK2 sequence comprises all or a portion of AK2 coding sequence 1. In some embodiments, the human AK2 sequence comprises all or a portion of AK2 coding sequence 2. In some embodiments, the human AK2 sequence comprises all or a portion of AK2 coding sequence 3. Those skilled in the art will appreciate that the guide sequences shown in FIGS. 2, 3 and 4 can be used with the CRISPR/Cas systems of the present invention to alter target polynucleotide sequences of human AK2, and in particular human AK2 coding sequences 1, 2 and 3, respectively.

It should also be appreciated that altering a target polynucleotide sequence of AK2 can be used to treat any abnormal phenotype associated with an altered AK2 polynucleotide sequence. Table 2 below shows gene phenotype relationships identified by the OMIM® database. Further information regarding a phenotype listed in Table 2 is publicly accessible by querying OMIM for the search term “AK2” and then clicking on a hyperlink in the “Phenotype MIM number” column corresponding to the phenotype listed.

TABLE 2 AK2 Gene Phenotype Relationships Phenotype Location Phenotype MIM number 1p35.1 Reticular dysgenesis 267500

In some embodiments, the SCID-associated polynucleotide sequence is CD3 antigen, delta subunit (CD3D) or a variant thereof. An exemplary CD3D sequence is a human CD3D sequence (NCBI Gene ID: 915, also known as T3D and CD3-DELTA). In some embodiments, the human CD3D sequence comprises all or a portion of CD3D coding sequence 1. In some embodiments, the human CD3D sequence comprises all or a portion of CD3D coding sequence 2. Those skilled in the art will appreciate that the guide sequences shown in FIGS. 5 and 6 can be used with the CRISPR/Cas systems of the present invention to alter target polynucleotide sequences of human CD3D, and in particular human CD3D coding sequences 1 and 2, respectively.

It should also be appreciated that altering a target polynucleotide sequence of CD3D can be used to treat any abnormal phenotype associated with an altered CD3D polynucleotide sequence. Table 3 below shows gene phenotype relationships identified by the OMIM® database. Further information regarding a phenotype listed in Table 3 is publicly accessible by querying OMIM for the search term “CD3D” and then clicking on a hyperlink in the “Phenotype MIM number” column corresponding to the phenotype listed.

TABLE 3 CD3D Gene Phenotype Relationships Phenotype Location Phenotype MIM number 11q23.3 Severe combined immunodeficiency, 608971 T cell-negative, B- cell/natural killer-cell positive

In some embodiments, the SCID-associated polynucleotide sequence is DNA cross-link repair protein 1C (DCLRE1C) or a variant thereof. An exemplary DCLRE1C sequence is a human DCLRE1C sequence (NCBI Gene ID: 64421, also known as SCIDA, SNM1C, A-SCID, hSNM1C, RS-SCID, DCLRE1C). In some embodiments, the human DCLRE1C sequence comprises all or a portion of DCLRE1C coding sequence 1. In some embodiments, the human DCLRE1C sequence comprises all or a portion of DCLRE1C coding sequence 2. In some embodiments, the human DCLRE1C sequence comprises all or a portion of DCLRE1C coding sequence 3. In some embodiments, the human DCLRE1C sequence comprises all or a portion of DCLRE1C coding sequence 4. Those skilled in the art will appreciate that the guide sequences shown in FIGS. 7, 8, 9, and 10 can be used with the CRISPR/Cas systems of the present invention to alter target polynucleotide sequences of human DCLRE1C, and in particular human DCLRE1C coding sequences 1, 2, 3, and 4, respectively.

It should also be appreciated that altering a target polynucleotide sequence of DCLRE1C can be used to treat any abnormal phenotype associated with an altered DCLRE1C polynucleotide sequence. Table 4 below shows gene phenotype relationships identified by the OMIM® database. Further information regarding a phenotype listed in Table 4 is publicly accessible by querying OMIM for the search term “DCLRE1C” and then clicking on a hyperlink in the “Phenotype MIM number” column corresponding to the phenotype listed.

TABLE 4 DCLRE1C Gene Phenotype Relationships Phenotype Location Phenotype MIM number 10p13 Omenn syndrome 603554 Severe combined immunodeficiency, 602450 Athabascan type

In some embodiments, the SCID-associated polynucleotide sequence is interleukin 2 receptor, gamma (IL2RG) or a variant thereof. An exemplary IL2RG sequence is a human IL2RG sequence (NCBI Gene ID: 3561, also known as P64; CIDX; IMD4; CD132; SCIDX; IL-2RG; and SCIDX1). Those skilled in the art will appreciate that the guide sequences shown in FIG. 12 can be used with the CRISPR/Cas systems of the present invention to alter target polynucleotide sequences of human IL2RG.

It should also be appreciated that altering a target polynucleotide sequence of IL2RG can be used to treat any abnormal phenotype associated with an altered IL2RG polynucleotide sequence. Table 5 below shows gene phenotype relationships identified by the OMIM® database. Further information regarding a phenotype listed in Table 5 is publicly accessible by querying OMIM for the search term “IL2RG” and then clicking on a hyperlink in the “Phenotype MIM number” column corresponding to the phenotype listed.

TABLE 5 IL2RG Gene Phenotype Relationships Phenotype Location Phenotype MIM number Xq13.1 Combined immunodeficiency, 312863 X-linked, moderate Severe combined immunodeficiency, 300400 X-linked

In some embodiments, the SCID-associated polynucleotide sequence is interleukin 7 receptor (IL7R) or a variant thereof. An exemplary IL7R sequence is a human IL7R sequence (NCBI Gene ID: 3575, also known as ILRA, CD127, IL7RA, CDW127, IL-7R-alpha). Those skilled in the art will appreciate that the guide sequences shown in FIG. 13 can be used with the CRISPR/Cas systems of the present invention to alter target polynucleotide sequences of human IL7R.

It should also be appreciated that altering a target polynucleotide sequence of IL7R can be used to treat any abnormal phenotype associated with an altered IL7R polynucleotide sequence. Table 6 below shows gene phenotype relationships identified by the OMIM® database. Further information regarding a phenotype listed in Table 6 is publicly accessible by querying OMIM for the search term “IL7R” and then clicking on a hyperlink in the “Phenotype MIM number” column corresponding to the phenotype listed.

TABLE 6 IL7R Gene Phenotype Relationships Phenotype Location Phenotype MIM number 5p13.2 Severe combined immunodeficiency, 608971 T-cell negative, B-cell/natural killer cell-positive type

In some embodiments, the SCID-associated polynucleotide sequence is Janus kinase 3(JAK3) or a variant thereof. An exemplary JAK3 sequence is a human JAK3 sequence (NCBI Gene ID: 3718, also known as JAKL; LJAK; JAK-3; L-JAK; JAK3_HUMAN). Those skilled in the art will appreciate that the guide sequences shown in FIG. 14 can be used with the CRISPR/Cas systems of the present invention to alter target polynucleotide sequences of human JAK3.

It should also be appreciated that altering a target polynucleotide sequence of JAK3 can be used to treat any abnormal phenotype associated with an altered JAK3 polynucleotide sequence. Table 7 below shows gene phenotype relationships identified by the OMIM® database. Further information regarding a phenotype listed in Table 7 is publicly available by querying OMIM for the search term “JAK3” and then clicking on a hyperlink in the “Phenotype MIM number” column corresponding to the phenotype listed.

TABLE 7 JAK3 Gene Phenotype Relationships Phenotype Location Phenotype MIM number 19p13.11 SCID, autosomal recessive, 600802 T-negative/B-positive type

In some embodiments, the SCID-associated polynucleotide sequence is ligase IV, DNA, ATP-dependent (LIG4) or a variant thereof. An exemplary LIG4 sequence is a human LIG4 sequence (NCBI Gene ID: 3981). In some embodiments, the human LIG4 sequence comprises all or a portion of LIG4 coding sequence 1. In some embodiments, the human LIG4 sequence comprises all or a portion of LIG4 coding sequence 2. In some embodiments, the human LIG4 sequence comprises all or a portion of LIG4 coding sequence 3. Those skilled in the art will appreciate that the guide sequences shown in FIGS. 15, 16, and 17 can be used with the CRISPR/Cas systems of the present invention to alter target polynucleotide sequences of human LIG4, and in particular human LIG4 coding sequences 1, 2, and 3, respectively.

It should also be appreciated that altering a target polynucleotide sequence of LIG4 can be used to treat any abnormal phenotype associated with an altered LIG4 polynucleotide sequence. Table 8 below shows gene phenotype relationships identified by the OMIM® database. Further information regarding a phenotype listed in Table 8 is publicly accessible by querying OMIM for the search term “LIG4” and then clicking on a hyperlink in the “Phenotype MIM number” column corresponding to the phenotype listed.

TABLE 8 LIG4 Gene Phenotype Relationships Phenotype Location Phenotype MIM number 13q33.3 LIG4 syndrome 606593 Severe combined immunodeficiency with 602450 sensitivity to ionizing radiation {Multiple myeloma, resistance to} 254500

In some embodiments, the SCID-associated polynucleotide sequence is nonhomologous end-joining factor 1 (NHEJ1) or a variant thereof. An exemplary NHEJ1 sequence is the human NHEJ1 sequence (NCBI Gene ID: 79840, also known as XLF). Those skilled in the art will appreciate that the guide sequences shown in FIG. 18 can be used with the CRISPR/Cas systems of the present invention to alter target polynucleotide sequences of human NHEJ1.

It should also be appreciated that altering a target polynucleotide sequence of NHEJ1 can be used to treat any abnormal phenotype associated with an altered NHEJ1 polynucleotide sequence. Table 9 below shows gene phenotype relationships identified by the OMIM® database. Further information regarding a phenotype listed in Table 9 is publicly accessible by querying OMIM for the search term “NHEJ1” and then clicking on a hyperlink in the “Phenotype MIM number” column corresponding to the phenotype listed.

TABLE 9 NHEJ1 Gene Phenotype Relationships Phenotype Location Phenotype MIM number 2q35 Severe combined immunodeficiency 611291 with microcephaly, growth retardation, and sensitivity to ionizing radiation

In some embodiments, the SCID-associated polynucleotide sequence is a purine nucleoside phosphorylase sequence (PNP) or a variant thereof. An exemplary PNP sequence is human PNP (NCBI Gene ID: 4860, also known as NP, PUNP, and PRO1837). Those skilled in the art will appreciate that the guide sequences shown in FIG. 19 can be used with the CRISPR/Cas systems of the present invention to alter target polynucleotide sequences of human PNP.

It should also be appreciated that altering a target polynucleotide sequence of PNP can be used to treat any abnormal phenotype associated with an altered PNP polynucleotide sequence. Table 10 below shows gene phenotype relationships identified by the OMIM® database. Further information regarding a phenotype listed in Table 10 is publicly accessible by querying OMIM for the search term “PNP” and then clicking on a hyperlink in the “Phenotype MIM number” column corresponding to the phenotype listed.

TABLE 10 PNP Gene Phenotype Relationships Phenotype Location Phenotype MIM number 14q11.2 Immunodeficiency due to purine nucleoside 613179 phosphorylase deficiency

In some embodiments, the SCID-associated polynucleotide sequence is a protein kinase, DNA activated, catalytic polypeptide sequence (PRKDC) or a variant thereof. An exemplary PRKDC sequence is human PRKDC (NCBI Gene ID: 5591, also known as HYRC; p350; DNAPK; DNPK1; HYRC1; XRCC7; and DNA-PKcs). In some embodiments, the human PRKDC sequence comprises all or a portion of PRKDC coding sequence 1. In some embodiments, the human PRKDC sequence comprises all or a portion of PRKDC coding sequence 2. Those skilled in the art will appreciate that the guide sequences shown in FIGS. 20 and 21 can be used with the CRISPR/Cas systems of the present invention to alter target polynucleotide sequences of human PRKDC, and in particular human PRKDC coding sequences 1 and 2, respectively.

It should also be appreciated that altering a target polynucleotide sequence of PRKDC can be used to treat any abnormal phenotype associated with an altered PRKDC polynucleotide sequence. In some embodiments, the phenotype associated with an altered PRKDC polynucleotide sequence is SCID. In some embodiments, the phenotype associated with an altered PRKDC polynucleotide sequence is radiosensitivity in xeroderma pigmentosum (Abbaszadeh et al., A novel splice variant of the DNA-PKcs gene is associated with clinical and cellular radiosensitivity in a patient with xeroderma pigmentosum. J Med Genet. 2010; 47(3):176-81).

In some embodiments, the SCID-associated polynucleotide sequence is a recombination activating gene 1 sequence (RAG1) or a variant thereof. An exemplary RAG1 sequence is human RAG1 (NCBI Gene ID: 5896, also known as RAG-1 and RNF74). Those skilled in the art will appreciate that the guide sequences shown in FIG. 22 can be used with the CRISPR/Cas systems of the present invention to alter target polynucleotide sequences of human RAG1.

It should also be appreciated that altering a target polynucleotide sequence of RAG1 can be used to treat any abnormal phenotype associated with an altered RAG1 polynucleotide sequence. Table 11 below shows gene phenotype relationships identified by the OMIM® database. Further information regarding a phenotype listed in Table 11 is publicly accessible by querying OMIM for the search term “RAG1” and then clicking on a hyperlink in the “Phenotype MIM number” column corresponding to the phenotype listed.

TABLE 11 RAG1 Gene Phenotype Relationships Phenotype Location Phenotype MIM number 11p12 Alpha/beta T-cell lymphopenia with 609889 gamma/delta T-cell expansion, severe cytomegalovirus infection, and autoimmunity Combined cellular and humoral 233650 immune defects with granulomas Omenn syndrome 603554 Severe combined immunodeficiency, 601457 B cell-negative

In some embodiments, the SCID-associated polynucleotide sequence is a recombination activating gene 2 sequence (RAG2) or a variant thereof. An exemplary RAG2 sequence is human RAG2 (NCBI Gene ID: 5897, also known as RAG-2). In some embodiments, the human RAG2 sequence comprises all or a portion of human RAG2 coding sequence 1. In some embodiments, the human RAG2 sequence comprises all or a portion of human RAG2 coding sequence 2. In some embodiments, the human RAG2 sequence comprises all or a portion of human RAG2 coding sequence 3. Those skilled in the art will appreciate that the guide sequences shown in FIGS. 23, 24 and 25 can be used with the CRISPR/Cas systems of the present invention to alter target polynucleotide sequences of human RAG2, and in particular human RAG2 coding sequences 1, 2 and 3, respectively.

It should also be appreciated that altering a target polynucleotide sequence of RAG2 can be used to treat any abnormal phenotype associated with an altered RAG2 polynucleotide sequence. Table 12 below shows gene phenotype relationships identified by the OMIM® database. Further information regarding a phenotype listed in Table 12 is publicly accessible by querying OMIM for the search term “RAG2” and then clicking on a hyperlink in the “Phenotype MIM number” column corresponding to the phenotype listed.

TABLE 12 RAG2 Gene Phenotype Relationships Phenotype Location Phenotype MIM number 11p12 Combined cellular and humoral immune 233650 defects with granulomas Omenn syndrome 603554 Severe combined immunodeficiency, 601457 B cell-negative

In some embodiments, the SCID-associated polynucleotide sequence is a zeta-chain-associated protein kinase sequence (ZAP70) or a variant thereof. An exemplary ZAP70 sequence is human ZAP70 (NCBI Gene ID: 7535, also known as SRK; STD; TZK; STCD; and ZAP-70). In some embodiments, the human ZAP70 sequence comprises all or a portion of human ZAP70 coding sequence 1. In some embodiments, the human ZAP70 sequence comprises all or a portion of human ZAP70 coding sequence 2. Those skilled in the art will appreciate that the guide sequences shown in FIGS. 26 and 27 can be used with the CRISPR/Cas systems of the present invention to alter target polynucleotide sequences of human ZAP70, and in particular human ZAP70 coding sequences 1 and 2, respectively.

It should also be appreciated that altering a target polynucleotide sequence of ZAP70 can be used to treat any abnormal phenotype associated with an altered ZAP70 polynucleotide sequence. Table 13 below shows gene phenotype relationships identified by the OMIM® database. Further information regarding a phenotype listed in Table 13 is publicly accessible by querying OMIM for the search term “ZAP70” and then clicking on a hyperlink in the “Phenotype MIM number” column corresponding to the phenotype listed.

TABLE 13 ZAP 70 Gene Phenotype Relationships Phenotype Location Phenotype MIM number 2q11.2 Selective T-cell 269840 defect

In some embodiments, the target polynucleotide sequence is hemoglobin beta (“HBB”) (e.g., human hemoglobin beta, NCBI Gene ID: 3043) or a variant thereof. Those skilled in the art will appreciate that the guide sequences shown in FIG. 11 can be used with the CRISPR/Cas systems of the present invention to alter target polynucleotide sequences of human HBB.

It should also be appreciated that altering a target polynucleotide sequence of HBB can be used to treat any abnormal phenotype associated with an altered HBB polynucleotide sequence. Table 14 below shows gene phenotype relationships as identified by the OMIM® database. Further information regarding a phenotype listed in Table 14 is publicly accessible by querying OMIM for the search term “HBB” and then clicking on a hyperlink in the “Phenotype MIM number” column corresponding to the phenotype listed.

TABLE 14 HBB Gene Phenotype Relationships Phenotype Location Phenotype MIM number 11p15.4 Delta-beta thalassemia 141749 Erythremias, beta- Heinz body anemias, beta- 140700 Hereditary persistence of fetal hemoglobin 141749 Methemoglobinemias, beta- Sickle cell anemia 603903 Thalassemia-beta, dominant inclusion-body 603902 Thalassemias, beta- 613985 {Malaria, resistance to} 611162

Normal adult hemoglobin is a tetramer that consists of two alpha chains and two beta chains. HBB determines the structure of the beta chains of hemoglobin. HBB mutations are associated with sickle cell diseases and/or beta thalassemia. For example, sickle cell anemia is caused by mutant beta globin. The absence of the beta chain results in beta-zero thalassemia. Diminished amounts of detectable beta globin results in beta-plus-thalassemia. Exemplary mutant forms of HBB involved in sickle cell disease are Hemoglobin S (Glu6Val), Hemoglobin C (Glu6Lys), Hemoglobin D (Glu121G1n), and Hemoglobin 0 (Glu121Lys).

Insertion of an L1 retrotransposable fragment within the IVS-II of the beta-globin gene results in Beta®-thal. This represents a form of beta thalassemia in which the beta globin gene expresses full length beta-globin transcripts at levels equal to about 15% of the total beta-globin mRNA.

In some embodiments, a method for altering a target polynucleotide sequence in a cell comprises a method for altering a target sickle cell disease (SCD)-associated polynucleotide sequence. As used herein, “sickle cell disease-associated polynucleotide sequence” or “SCD-associated polynucleotide sequence” are used interchangeably to refer to a polynucleotide sequence of the HBB gene displaying one or more HBB mutations associated with SCD. As used herein, “sickle cell disease” refers to a group of symptomatic disorders involving mutations in HBB and defined by the presence of hemoglobin S (Hb S). Normal hemoglobin is a heterotetramer consisting of two alpha-hemoglobin and two beta-hemoglobin chains. Point mutations in HBB cause hemoglobin S to result, for example a point mutation changing the sixth amino acid in the beta-hemoglobin chain from glutamic acid to valine (Glu6Val). Sickle cell anemia (homogzygous HbSS) is an example of a sickle cell disease which makes up between 60-70% of reported sickle cell disease in the United States. Examples of other forms of sickle cell disease are due to coinhereitance of Hb S with various mutant beta-globin chain variants, including sickle-hemoglobin C disease (Hb SC), and two different types of sickle beta thalassemia (Hb Sβ⁺_ thalassemia and Hb S β°_thalassemia).

An exemplary method for altering a target sickle cell disease (SCD)-associated polynucleotide sequence in a cell comprises contacting the SCD-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCD-associated polynucleotide sequence, wherein the target SCD-associated polynucleotide sequence is cleaved. In some embodiments of this and other aspects, the efficiency of alteration of cells that express Cas protein is from about 50% to about 80%.

In some embodiments, a method for altering a target polynucleotide sequence in a cell comprises a method for altering a target beta thalassemia-associated polynucleotide sequence. As used herein, “beta thalassemia-associated polynucleotide sequence” refers to a polynucleotide sequence of the HBB gene displaying one or more HBB mutations associated with beta thalassemia. As used herein, “beta thalassemia” refers to inherited autosomal recessive diseases characterized by decreased production of the hemoglobin subunit beta (e.g., hemoglobin beta chain) that are caused by over 200 different HBB mutations. As will be appreciated by the skilled artisan, HBB mutations resulting in beta thalassemia include non-deletional HBB mutants, deletional HBB mutants, and HBB mutants resulting from transposable elements. Table 15 below illustrates exemplary non-deletional HBB mutations.

TABLE 15 Non-deletional HBB mutations associated with beta thalassemia −101 (C−>T)  −92 (C−>T)  −90 (C−>T)  −88 (C−>A)  −88 (C−>T)  −87 (C−>A)  −87 (C−>G)  −87 (C−>T)  −86 (C−>A)  −86 (C−>G)  −32 (C−>A)  −31 (A−>C)  − 31 (A−>G)  −30 (T−>A)  −30 (T−>C)  −29 (A−>G)  −28 (A−>C)  −28 (A−>G) CAP +1 (A−>C) 5′UTR; +10 (−T) 5′UTR; +22 (G−>A) 5′UTR; +33 (C−>G) 5′UTR; +43 to +40 (−AAAC) Initiation codon ATG−>GTG Initiation codon ATG−>ACG Initiation codon ATG−>AGG Initiation codon ATG−>ATA Initiation codon ATG−>ATC Initiation codon ATG−>ATT Codon 1 (−G); GTG(Val)−>−TG Codons 2/3/4 (−9 bp; +31 bp); (see below) Codon 5 (−CT); CCT(Pro)−>C−− Codon 6 (−A); GAG(Glu)−>G−G Codon 8 (−AA); AAG(Lys)−>−−G Codons 8/9 (+G); AAG·TCT(Lys;Ser)−>AAG·G·TCT Codons 9/10 (+T); TCT·GCC(Ser;Ala)−>TCT·T·GCC Codon 10 (C−>A); GCC(Ala)−>GCA(Ala) Codon 11 (−T); GTT(Val)−>GT− Codons 14/15 (+G); Codon 15 (G−>A); TGG(Trp)−>TAG(stop codon) Codon 15 (G−>A); TGG(Trp)−>TGA(stop codon) Codon 15 (−T); TGG(Trp)−>−GG Codon 16 (−C); GGC(Gly)−>GG− Codon 17 (A−>T); AAG(Lys)−>TAG(stop codon) Codon 19 (A−>G); AAC(Asn)−>AGC(Ser) Codon 22 (A−>C); GAA(Glu)−>GCA(Ala) (not listed in TABLE I; this mutation is likely not associated with thalassemia) Codon 22 (G−>T); GAA(Glu)−>TAA(stop codon) Codons 22/23/24 (GAA·GTT·GGT; Glu·Val·Gly); deletion of −AAGTTGG Codon 24; GGT(Gly); (−G; +CAC) Codon 24 (T−>A); GGT(Gly)−>GGA(Gly) Codons 24/25 (−GGT); GGT·GGT(Gly-Gly)−>−−−·GGT(Gly) Codons 25/26 (+T); GGT·GAG(Gly-Glu)−>GGT·T·GAG(Gly-Term) Codon 26 (GAG−>AAG) Codon 26 (G−>T); GAG(Glu)−>TAG(stop codon) Codon 26 (+T); GAG(Glu)−>GTAG Codon 27 (G−>T); GCC(Ala)−>TCC(Ser) Codons 27/28 (+C); GCC·CTG(Ala·Ser)−>GCC·C·CTG Codon 28 (−C); CTG(Leu)−>−TG Codon 28 (T−>G); CTG(Leu)−>CGG(Arg) Codons 28/29 (−G); CTG·GGC(Leu·Gly)−>CTG·−GC IVS-I (−3) or codon 29 (C−>T); GGC(Gly)−>GGT(Gly) IVS-I (−2) or codon 30 (A−>G); AG{circumflex over ( )}GTTGGT−>GG{circumflex over ( )}GTTGGT IVS-I (−1) or codon 30 (G−>A); AG{circumflex over ( )}GTTGGT−>AA{circumflex over ( )}GTTGGT IVS-I (−1) or codon 30 (G−>C); AG{circumflex over ( )}GTTGGT−>AC{circumflex over ( )}GTTGGT IVS-I-1 (G−>A); AG{circumflex over ( )}GTTGGT−>AGATTGGT IVS-I-1 (G−>T); AG{circumflex over ( )}GTTGGT−>AGTTTGGT IVS-I-2 (T−>A); AG{circumflex over ( )}GTTGGT−>AGGATGGT IVS-I-2 (T−>C); AG{circumflex over ( )}GTTGGT−>AGACTGGT IVS-I-2 (T−>G); AG{circumflex over ( )}GTTGGT−>AGGGTGGT IVS-I-5 (G−>A) IVS-I-5 (G−>A) plus the 7,201 bp deletion involving part of the delta gene; the Corfu deletion (deltabeta-thal) IVS-I-5 (G−>C) IVS-I-5 (G−>T) IVS-I-6 (T−>C); the Portuguese type IVS-I-110 (G−>A) IVS-I-116 (T−>G) IVS-I-128 (T−>G); TTAG{circumflex over ( )}GCTG−>TGAG{circumflex over ( )}GCTG IVS-I-130 (G−>A); TTAG{circumflex over ( )}GCTG−>TTAA GCTG IVS-I-130 (G−>C); TTAG{circumflex over ( )}GCTG−>TTAC GCTG Codon 30 (AGG−>AGC) [IVS-I-130 (+1)] IVS-I, 3′ end; −17 bp Codon 31 (−C); CTG−>−TG Codons 31/32 (+CGG) Codon 32 (T−>A) CTG−>CAG; codon 98 (G−>A) GTG−>ATG Codons 33/34 (−GTG); GTG·GTC(Val-Val)−>GTC·−−−(Val) Codon 35 (C−>A); TAC−>TAA (Tyr−>Term codon) Codon 35 (−C); TAC(Tyr)−>TA− Codons 36/37 (−T); CCT·TGG(Pro-Trp)−>CCT·−GG Codon 37 (G−>A); TGG(Trp)−>TGA(stop codon) Codons 37/38/39 (−7 nts) Codons 38/39 (−C); ACC·CAG(Thr·Gln)−>ACC·−AG Codons 38/39 (−CC); ACC·CAG(Thr-Glu)−>A−−·CAG Codon 39 (C−>T); CAG(Gln)−>TAG(stop codon) Codon 40 (−G); AGG(Arg)−>AG− Codons 40/41 (+T); AGG·TTC(Arg-Phe)−>AGG·T·TTC Codon 41 (−C); TTC(Phe)−>TT− Codons 41/42 (−TTCT); TTC·TTT(Phe-Phe)−>−−− −TT Codons 42/43 (+G); TTT·GAG(Phe·Glu)−>TTT·G·GAG Codons 42/43 (+T) TTT·GAG(Phe·Glu)−>TTT·TGA·G(Phe;stop codon) Codon 43 (G−>T); GAG(Glu)−>TAG (stop codon) Codon 44 (−C); TCC(Ser)−>TC− Codon 45 (−T); TTT(Phe)−>−TT Codon 47 (+A); GAT(Asp)−>GAA(Glu)·T Codons 47/48 (+ATCT); GAT·CTG(Asp-Leu)−>GAT·CTATCTG Codon 51 (−C); CCT(Pro)−>−CT 53/54 (+G); GCT·GTT(Ala-Val)−>GCT·G·GTT Codon 54 (−T); GTT(Val)−>GT− Codons 54/55 (+A); GTT·ATG(Val·Met)−>GTT·A·ATG Codons 56/57/58/59/60 (GGC·AAC·CCT·AAG·GTG) (SEQ ID NO: 5345); duplication of 14 bp Codons 57/58 (+C); AAC·CCT(Asn·Pro)−>AAC·C·CCT Codon 59 (−A); AAG(Lys)−>−AG Codon 60 (T−>A); GTG(Val)−>GAG(Glu) Codon 61 (A−>T); AAG(Lys)−>TAG(stop codon) Codon 64 (−G); GGC(Gly)−>−GC Codon 67 (−TG); GTG(Val)−>−−G Codons 71/72 (+A); TTT·AGT(Phe·Ser)−>TTT·A·AGT Codons 71/72 (+T); TTT·AGT(Phe·Ser)−>TTT·T·AGT Codons 72/73; −AGTGA, +T; AGT·GAT(Ser-Asp)−>−−− −TT Codons 74/75 (−C); GGC·CTG(Gly·Leu)−>GG−·CTG Codon 76 (−C); GCT(Ala)−>G−T Codons 82/83 (−G); AAG·GGC(Lys·Gly)−>AAG·−GC Codons 84/85 (+C); ACC·TTT(Thr·Phe)−>ACC·C·TTT Codons 84/85/86 (+T); ACC·TTT·GCC(Thr·Phe·Ala)−>ACC·TTT·T·GCC Codon 88 (+T); CTG(Leu)−>CTTG Codons 89/90 (−GT); AGT·GAG(Ser·Glu)−>A−−·GAG Codon 90 (G−>T); GAG(Glu)−>TAG(stop codon) Codon 94 (+TG); GAC(Asp)−>GTGAC Codon 95 (+A); AAG(Lys)−>AAAG Codon 100; −CTT, +TCTGAGAACTT IVS-II-1 (G−>A); IVS-II-1 (G−>C); IVS-II-2,3 (+11, −2); insertion of 11 bp (5′-ACGTTCT CTGA-3′) (SEQ ID NO: 5346)_and deletion of GA (nts 2 and 3 of IVS-II) between positions 1 and 4 of IVS-II IVS-II-4,5 (−AG); IVS-II-5 (G−>C) IVS-II-654 (C−>T); AAGGCAATA−>AAG{circumflex over ( )}GTAATA IVS-II-705 (T−>G); GATGTAAGA−>GAG{circumflex over ( )}GTAAGA IVS-II-745 (C−>G); CAGCTACCAT−>CAG{circumflex over ( )}GTACCAT IVS-II-837 (T−>G); IVS-II-843 (T−>G); IVS-II-844 (C−>G); IVS-II-848 (C−>A); IVS-II-848 (C−>G); IVS-II-849 (A−>C); IVS-II-849 (A−>G); IVS-II-850 (−G); IVS-II-850 (G−>A); IVS-II-850 (G−>C); IVS-II-850 (G−>T); Codons 106/107 (+G); CTG·GGC(Leu·Gly)−>CTG·G·GC Codons 108/109/110/111/112 (−12 bp); Codon 109 (−G); GTG(Val)−>−TG Codon 110 (T−>C); CTG(Leu)−>CCG(Pro) Codon 112 (T−>A); TGT(Cys)−>TGA(stop codon) Codon 114 (T−>C); CTG(Leu)−>CCG(Pro) Codon 114 (−CT; +G); CTG(Leu)−>−GG Codon 115 (C−>A); GCC(Ala)−>GAC(Asp) Codons 120/121 (+A); AAA·GAA(Lys-Glu)−>AAA·A·GAA Codon 121 (G−>T); GAA(Glu)−>TAA(stop codon) Codon 123 (−A); ACC(Thr)−>−CC Codons 123/124/125 (−ACCCCACC); ACC·CCA·CCA(Thr·Pro·Pro)−>−−− −−− −−A Codon 124 (−A); CCA(Pro)−>CC− Codon 125 (−A); CCA(Pro)−>CC− Codons 124/125/126 (+CCA); CCA·CCA·GTG(Pro·Pro·Val) (SEQ ID NO: 5347) −>CCA·CCA·CCA·GTG (Pro·Pro·Pro·Val) (SEQ ID NO: 5348) Codon 126 (−T); GTG(Val)−>G−G Codon 126 (T−>G); GTG(Val)−>GGG(Gly) Codons 126−131 (Val-Gln-Ala-Ala-Thr-Gln (SEQ ID NO: 5349)) (−17 bp); GTG·GAG·GCT·GCC·TAT·CAG (SEQ ID NO: 5350) −>G Codon 127 (A−>C); CAG(Gln)−>CCG(Pro) Codon 127 (A−>G); CAG(Gln)−>CGG(Arg) Codon 127 (C−>T); CAG(Gln)−>TAG(stop codon) Codons 127/128 (−AGG); CAG-GCT(Gln·Ala)−>C−− −CT(Pro) Codons 128/129 (−4 bp, −GCTG; +5 bp, +CCACA) Codons 132−135 (−11 bp, −AAAGTGGTGGC) (SEQ ID NO: 5351) Codons 134/135/136/137 [−(G)TGGCTGGTGT(G) (SEQ ID NO: 5352) and +(G)GCAG(G)]; GTG·GCT·GGT·GTG (SEQ ID NO: 5353) (Val-Ala-Gly-Val) (SEQ ID NO: 5354) −>GGC·AGG(Gly-Arg) +1480 (C−>G); also known as 3′ terminating codon +6 (C−>G) 3′UTR (−GCATCTGGATTCT) (SEQ ID NO: 5355) 13 bp deletion between positions +1565 to +1577 (the numbers are relative to the Cap site) T−>C; 12 nts 5′ to the poly A site or +1570 (the number is relative to the Cap site) Poly A (T−>C); AATAAA−>AACAAA Poly A (A−>G); AATAAA−>AATGAA Poly A (A−>G); AATAAA−>AATAGA Poly A (A−>G); AATAAA−>AATAAG Poly A (−AT or −TA); AATAAA−>A−−AAA Poly A (−AATAA); AATAAA−−−−−>A

Those skilled in the art will also appreciate that a variety of deletional beta thalassemia alleles exist, which tend to be prevalent in certain at-risk populations. Examples of such deletional beta thalassemia alleles include, but are not limited to, a 25 bp deletion, a 44 bp deletion, a 105 bp deletion, a 290 bp deletion, a 532 bp deletion, a 619 bp deletion, a 1,393 bp deletion, a 1,605 bp deletion (“Croatian deletion”), a 3,485 bp deletion (“Thai deletion”), a 4,237 bp deletion (“Czech deletion”), a 7.6 kb deletion (“Turkish deletion”); a 10,329 bp deletion (“Asian Indian deletion”), a 12,023 bp deletion (“Australian deletion”); a 12,620 bp deletion (“Dutch deletion”), a 27 kb deletion (“Southeast Asian deletion”), a 45 kb deletion (“Filipino deletion”), and a 65 kb deletion (“Italian deletion”). Those skilled in the art will be able to retrieve the corresponding nucleic acid and protein sequences corresponding to these deletions from publicly available sources (e.g., A Syllabus of Thalassemia Mutations (1997) by Titus H. J. Huisman, et al. published by The Sickle Cell Anemia Foundation in Augusta, Ga., USA, available online at http://globin.cse.psu.edu/html/huisman/thals/I-b.entries.html).

An exemplary method for altering a target beta thalassemia-associated polynucleotide sequence in a cell comprises contacting the beta thalassemia-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target beta thalassemia-associated polynucleotide sequence, wherein the target beta thalassemia associated polynucleotide sequence is cleaved. In some embodiments of this and other aspects, the efficiency of alteration of cells that express Cas protein is from about 50% to about 80%.

As used herein, the term “contacting” (i.e., contacting a polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and/or ribonucleic acids) is intended to include incubating the Cas protein and/or the ribonucleic acids in the cell together in vitro (e.g., adding the Cas protein or nucleic acid encoding the Cas protein to cells in culture). In some embodiments, the term “contacting” is not intended to include the in vivo exposure of cells to the Cas protein and/or ribonucleic acids as disclosed herein that may occur naturally in a microorganism (i.e., bacteria). The step of contacting a target polynucleotide sequence with a Cas protein and/or ribonucleic acids as disclosed herein can be conducted in any suitable manner. For example, the cells may be treated in adherent culture, or in suspension culture. It is understood that the cells contacted with a Cas protein and/or ribonucleic acids as disclosed herein can also be simultaneously or subsequently contacted with another agent, such as a growth factor or other differentiation agent or environments to stabilize the cells, or to differentiate the cells further.

In another aspect, the present invention provides a method for treating or preventing a disorder associated with expression of a polynucleotide sequence in a subject.

The terms “treat”, “treating”, “treatment”, etc., as applied to an isolated cell, include subjecting the cell to any kind of process or condition or performing any kind of manipulation or procedure on the cell. As applied to a subject, the terms refer to providing medical or surgical attention, care, or management to an individual. The individual is usually ill or injured, or at increased risk of becoming ill relative to an average member of the population and in need of such attention, care, or management.

As used herein, the term “treating” and “treatment” refers to administering to a subject an effective amount of a composition so that the subject has a reduction in at least one symptom of the disease or an improvement in the disease, for example, beneficial or desired clinical results. For purposes of this invention, beneficial or desired clinical results include, but are not limited to, alleviation of one or more symptoms, diminishment of extent of disease, stabilized (i.e., not worsening) state of disease, delay or slowing of disease progression, amelioration or palliation of the disease state, and remission (whether partial or total), whether detectable or undetectable. Treating can refer to prolonging survival as compared to expected survival if not receiving treatment. Thus, one of skill in the art realizes that a treatment may improve the disease condition, but may not be a complete cure for the disease. As used herein, the term “treatment” includes prophylaxis. Alternatively, treatment is “effective” if the progression of a disease is reduced or halted. “Treatment” can also mean prolonging survival as compared to expected survival if not receiving treatment. Those in need of treatment include those already diagnosed with a disorder associated with expression of a polynucleotide sequence, as well as those likely to develop such a disorder due to genetic susceptibility or other factors.

By “treatment”, “prevention” or “amelioration” of a disease or disorder is meant delaying or preventing the onset of such a disease or disorder, reversing, alleviating, ameliorating, inhibiting, slowing down or stopping the progression, aggravation or deterioration the progression or severity of a condition associated with such a disease or disorder. In one embodiment, the symptoms of a disease or disorder are alleviated by at least 5%, at least 10%, at least 20%, at least 30%, at least 40%, or at least 50%.

In some embodiments, a method for treating or preventing a disorder associated with expression of a polynucleotide sequence comprises a method for treating or preventing a disorder associated with expression of a SCID-associated polynucleotide sequence.

An exemplary method for treating or preventing a disorder associated with expression of a SCID-associated polynucleotide sequence in a subject comprises (a) altering a target SCID-associated polynucleotide sequence in a cell ex vivo by contacting the SCID-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCID-associated polynucleotide sequence, wherein the target SCD-associated polynucleotide sequence is cleaved, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the SCID-associated polynucleotide sequence. In some embodiments of this and other aspects, the efficiency of alteration of cells that express Cas protein is from about 50% to about 80%.

An exemplary method for treating or preventing a disorder associated with expression of a SCID-associated polynucleotide sequence in a subject comprises altering a target SCID-associated polynucleotide sequence in a cell by contacting the SCID-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCID-associated polynucleotide sequence, and wherein the target SCID-associated polynucleotide sequence is cleaved, thereby treating or preventing a disorder associated with expression of the SCID-associated polynucleotide sequence.

In some embodiments, a method for treating or preventing a disorder associated with expression of a polynucleotide sequence comprises a method for treating or preventing a disorder associated with expression of a SCD-associated polynucleotide sequence.

An exemplary method for treating or preventing a disorder associated with expression of a SCD-associated polynucleotide sequence in a subject comprises (a) altering a target SCD-associated polynucleotide sequence in a cell ex vivo by contacting the SCD-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCD-associated polynucleotide sequence, wherein the target SCD-associated polynucleotide sequence is cleaved, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the SCD-associated polynucleotide sequence. In some embodiments of this and other aspects, the efficiency of alteration of cells that express Cas protein is from about 50% to about 80%.

An exemplary method for treating or preventing a disorder associated with expression of a SCD-associated polynucleotide sequence in a subject comprises altering a target SCD-associated polynucleotide sequence in a cell by contacting the SCD-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCD-associated polynucleotide sequence, and wherein the target SCD-associated polynucleotide sequence is cleaved, thereby treating or preventing a disorder associated with expression of the SCD-associated polynucleotide sequence.

In some embodiments, a method for treating or preventing a disorder associated with expression of a polynucleotide sequence comprises a method for treating or preventing a disorder associated with expression of a beta thalassemia-associated polynucleotide sequence.

An exemplary method for treating or preventing a disorder associated with expression of a beta thalassemia-associated polynucleotide sequence in a subject comprises (a) altering a target beta thalassemia-associated polynucleotide sequence in a cell ex vivo by contacting the beta thalassemia-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target beta thalassemia-associated polynucleotide sequence, wherein the target beta thalassemia-associated polynucleotide sequence is cleaved, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the beta thalassemia-associated polynucleotide sequence. In some embodiments of this and other aspects, the efficiency of alteration of cells that express Cas protein is from about 50% to about 80%.

An exemplary method for treating or preventing a disorder associated with expression of a beta thalassemia-associated polynucleotide sequence in a subject comprises altering a target beta thalassemia-associated polynucleotide sequence in a cell by contacting the beta thalassemia-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target beta thalassemia-associated polynucleotide sequence, and wherein the target beta thalassemia-associated polynucleotide sequence is cleaved, thereby treating or preventing a disorder associated with expression of the beta thalassemia-associated polynucleotide sequence.

The present invention contemplates altering target polynucleotide sequences in any manner which is available to the skilled artisan utilizing a CRISPR/Cas system of the present invention. Any CRISPR/Cas system that is capable of altering a target polynucleotide sequence in a cell can be used. Such CRISPR-Cas systems can employ a variety of Cas proteins (Haft et al. PLoS Comput Biol. 2005; 1(6)e60). The molecular machinery of such Cas proteins that allows the CRISPR/Cas system to alter target polynucleotide sequences in cells include RNA binding proteins, endo- and exo-nucleases, helicases, and polymerases. In some embodiments, the CRISPR/Cas system is a CRISPR type I system. In some embodiments, the CRISPR/Cas system is a CRISPR type II system.

The CRISPR/Cas systems of the present invention can be used to alter a target polynucleotide sequence in a cell. The present invention contemplates altering target polynucleotide sequences in a cell for any purpose. In some embodiments, the target polynucleotide sequence in a cell is altered to produce a mutant cell. As used herein, a “mutant cell” refers to a cell with a resulting genotype that differs from its original genotype. In some instances, a “mutant cell” exhibits a mutant phenotype, for example when a normally functioning gene is altered using the CRISPR/Cas systems of the present invention. In other instances, a “mutant cell” exhibits a wild-type phenotype, for example when a CRISPR/Cas system of the present invention is used to correct a mutant genotype. In exemplary embodiments, a mutant cell exhibits a wild-type ADA phenotype. In exemplary embodiments, a mutant cell exhibits a wild-type AK2 phenotype. In exemplary embodiments, a mutant cell exhibits a wild-type CD3D phenotype. In exemplary embodiments, a mutant cell exhibits a wild-type DCLRE1C phenotype. In exemplary embodiments, a mutant cell exhibits a wild-type IL2RG phenotype. In exemplary embodiments, a mutant cell exhibits a wild-type IL7R phenotype. In exemplary embodiments, a mutant cell exhibits a wild-type JAK3 phenotype. In exemplary embodiments, a mutant cell exhibits a wild-type LIG4 phenotype. In exemplary embodiments, a mutant cell exhibits a wild-type NHEJ1 phenotype. In exemplary embodiments, a mutant cell exhibits a wild-type PNP phenotype. In exemplary embodiments, a mutant cell exhibits a wild-type PRKDC phenotype. In exemplary embodiments, a mutant cell exhibits a wild-type RAG1 phenotype. In exemplary embodiments, a mutant cell exhibits a wild-type RAG2 phenotype. In exemplary embodiments, a mutant cell exhibits a wild-type ZAP70 phenotype. In exemplary embodiments, a mutant cell exhibits a wild-type HBB phenotype. In some embodiments, the target polynucleotide sequence in a cell is altered to correct or repair a genetic mutation (e.g., to restore a normal phenotype to the cell). In an exemplary embodiment, a target SCID-associated polynucleotide sequence in a cell is altered to correct or repair one or more ADA mutations involved in SCID. In an exemplary embodiment, a target SCID-associated polynucleotide sequence in a cell is altered to correct or repair one or more AK2 mutations involved in SCID. In an exemplary embodiment, a target SCID-associated polynucleotide sequence in a cell is altered to correct or repair one or more CD3D mutations involved in SCID. In an exemplary embodiment, a target SCID-associated polynucleotide sequence in a cell is altered to correct or repair one or more DCRE1C mutations involved in SCID. In an exemplary embodiment, a target SCID-associated polynucleotide sequence in a cell is altered to correct or repair one or more IL2RG mutations involved in SCID. In an exemplary embodiment, a target SCID-associated polynucleotide sequence in a cell is altered to correct or repair one or more IL7R mutations involved in SCID. In an exemplary embodiment, a target SCID-associated polynucleotide sequence in a cell is altered to correct or repair one or more JAK3 mutations involved in SCID. In an exemplary embodiment, a target SCID-associated polynucleotide sequence in a cell is altered to correct or repair one or more LIG4 mutations involved in SCID. In an exemplary embodiment, a target SCID-associated polynucleotide sequence in a cell is altered to correct or repair one or more NHEJ1 mutations involved in SCID. In an exemplary embodiment, a target SCID-associated polynucleotide sequence in a cell is altered to correct or repair one or more PNP mutations involved in SCID. In an exemplary embodiment, a target SCID-associated polynucleotide sequence in a cell is altered to correct or repair one or more PRKDC mutations involved in SCID. In an exemplary embodiment, a target SCID-associated polynucleotide sequence in a cell is altered to correct or repair one or more RAG1 mutations involved in SCID. In an exemplary embodiment, a target SCID-associated polynucleotide sequence in a cell is altered to correct or repair one or more RAG2 mutations involved in SCID. In an exemplary embodiment, a target SCID-associated polynucleotide sequence in a cell is altered to correct or repair one or more ZAP70 mutations involved in SCID. In an exemplary embodiment, a target SCD-associated polynucleotide sequence in a cell is altered to correct or repair one or more HBB mutations involved in SCD. In an exemplary embodiment, a target SCD-associated polynucleotide sequence in a cell is altered to correct or repair one or more HBB mutations involved in SCD. In another exemplary embodiment, a target beta thalassemia-associated polynucleotide sequence in a cell is altered to correct or repair one or more HBB mutations involved in beta thalassemia. In some embodiments, the target polynucleotide sequence in a cell is altered to induce a genetic mutation (e.g., to disrupt the function of a gene or genomic element).

In some embodiments, the alteration is an indel. As used herein, “indel” refers to a mutation resulting from an insertion, deletion, or a combination thereof. As will be appreciated by those skilled in the art, an indel in a coding region of a genomic sequence will result in a frameshift mutation, unless the length of the indel is a multiple of three. In some embodiments, the alteration is a point mutation. As used herein, “point mutation” refers to a substitution that replaces one of the nucleotides. A CRISPR/Cas system of the present invention can be used to induce an indel of any length or a point mutation in a target polynucleotide sequence.

In some embodiments, the alteration results in a knock out of the target polynucleotide sequence or a portion thereof. Knocking out a target polynucleotide sequence or a portion thereof using a CRISPR/Cas system of the present invention can be useful for a variety of applications. For example, knocking out a target polynucleotide sequence in a cell can be performed in vitro for research purposes. For ex vivo or in vivo purposes, knocking out a target polynucleotide sequence in a cell can be useful for treating or preventing a disorder associated with expression of the target polynucleotide sequence.

As used herein, “knock out” includes deleting all or a portion of the target polynucleotide sequence in a way that interferes with the function of the target polynucleotide sequence. For example, a knock out can be achieved by altering a target polynucleotide sequence by inducing an indel in the target polynucleotide sequence in a functional domain of the target polynucleotide sequence (e.g., a DNA binding domain). Those skilled in the art will readily appreciate how to use the CRISPR/Cas systems of the present invention to knock out a target polynucleotide sequence or a portion thereof based upon the details described herein.

In some embodiments, the alteration results in reduced expression of the target polynucleotide sequence. The terms “decrease,” “reduced,” “reduction,” and “decrease” are all used herein generally to mean a decrease by a statistically significant amount. However, for avoidance of doubt, decrease,” “reduced,” “reduction,” “decrease” means a decrease by at least 10% as compared to a reference level, for example a decrease by at least about 20%, or at least about 30%, or at least about 40%, or at least about 50%, or at least about 60%, or at least about 70%, or at least about 80%, or at least about 90% or up to and including a 100% decrease (i.e. absent level as compared to a reference sample), or any decrease between 10-100% as compared to a reference level.

In some embodiments, the alteration results in increased expression of the target polynucleotide sequence. The terms “increased”, “increase” or “enhance” or “activate” are all used herein to generally mean an increase by a statically significant amount; for the avoidance of any doubt, the terms “increased”, “increase” or “enhance” or “activate” means an increase of at least 10% as compared to a reference level, for example an increase of at least about 20%, or at least about 30%, or at least about 40%, or at least about 50%, or at least about 60%, or at least about 70%, or at least about 80%, or at least about 90% or up to and including a 100% increase or any increase between 10-100% as compared to a reference level, or at least about a 2-fold, or at least about a 3-fold, or at least about a 4-fold, or at least about a 5-fold or at least about a 10-fold increase, or any increase between 2-fold and 10-fold or greater as compared to a reference level.

The term “statistically significant” or “significantly” refers to statistical significance and generally means a two standard deviation (2SD) below normal, or lower, concentration of the marker. The term refers to statistical evidence that there is a difference. It is defined as the probability of making a decision to reject the null hypothesis when the null hypothesis is actually true. The decision is often made using the p-value.

In some embodiments, the alteration is a homozygous alteration. In some embodiments, the alteration is a heterozygous alteration.

In some embodiments, the alteration results in correction of the target polynucleotide sequence from an undesired sequence to a desired sequence. The CRISPR/Cas systems of the present invention can be used to correct any type of mutation or error in a target polynucleotide sequence. For example, the CRISPR/Cas systems of the present invention can be used to insert a nucleotide sequence that is missing from a target polynucleotide sequence due to a deletion. The CRISPR/Cas systems of the present invention can also be used to delete or excise a nucleotide sequence from a target polynucleotide sequence due to an insertion mutation. In some instances, the CRISPR/Cas systems of the present invention can be used to replace an incorrect nucleotide sequence with a correct nucleotide sequence (e.g., to restore function to a target polynucleotide sequence that is impaired due to a loss of function mutation, i.e., a SNP).

In exemplary embodiments, the CRISPR/Cas systems of the present invention can be used to replace mutant ADA polynucleotide sequences with wild-type ADA polynucleotide sequences.

In exemplary embodiments, the CRISPR/Cas systems of the present invention can be used to replace mutant AK2 polynucleotide sequences with wild-type AK2 polynucleotide sequences.

In exemplary embodiments, the CRISPR/Cas systems of the present invention can be used to replace mutant CD3D polynucleotide sequences with wild-type CD3D polynucleotide sequences.

In exemplary embodiments, the CRISPR/Cas systems of the present invention can be used to replace mutant DCLRE1C polynucleotide sequences with wild-type DCLRE1C polynucleotide sequences.

In exemplary embodiments, the CRISPR/Cas systems of the present invention can be used to replace mutant IL2RG polynucleotide sequences with wild-type IL2RG polynucleotide sequences.

In exemplary embodiments, the CRISPR/Cas systems of the present invention can be used to replace mutant IL7R polynucleotide sequences with wild-type IL7R polynucleotide sequences.

In exemplary embodiments, the CRISPR/Cas systems of the present invention can be used to replace mutant JAK3 polynucleotide sequences with wild-type JAK3 polynucleotide sequences.

In exemplary embodiments, the CRISPR/Cas systems of the present invention can be used to replace mutant LIG4 polynucleotide sequences with wild-type LIG4 polynucleotide sequences.

In exemplary embodiments, the CRISPR/Cas systems of the present invention can be used to replace mutant NHEJ1 polynucleotide sequences with wild-type NHEJ1 polynucleotide sequences.

In exemplary embodiments, the CRISPR/Cas systems of the present invention can be used to replace mutant PNP polynucleotide sequences with wild-type PNP polynucleotide sequences.

In exemplary embodiments, the CRISPR/Cas systems of the present invention can be used to replace mutant PRKDC polynucleotide sequences with wild-type PRKDC polynucleotide sequences.

In exemplary embodiments, the CRISPR/Cas systems of the present invention can be used to replace mutant RAG1 polynucleotide sequences with wild-type RAG1 polynucleotide sequences.

In exemplary embodiments, the CRISPR/Cas systems of the present invention can be used to replace mutant RAG2 polynucleotide sequences with wild-type RAG2 polynucleotide sequences.

In exemplary embodiments, the CRISPR/Cas systems of the present invention can be used to replace mutant ZAP70 polynucleotide sequences with wild-type ZAP70 polynucleotide sequences.

In exemplary embodiments, the CRISPR/Cas systems of the present invention can be used to replace mutant HBB polynucleotide sequences with wild-type HBB polynucleotide sequences.

The CRISPR/Cas systems of the present invention can alter target polynucleotides with surprisingly high efficiency as compared to conventional CRISPR/Cas systems. In certain embodiments, the efficiency of alteration is at least about 5%. In certain embodiments, the efficiency of alteration is at least about 10%. In certain embodiments, the efficiency of alteration is from about 10% to about 80%. In certain embodiments, the efficiency of alteration is from about 30% to about 80%. In certain embodiments, the efficiency of alteration is from about 50% to about 80%. In some embodiments, the efficiency of alteration is greater than or equal to about 80%.

The CRISPR/Cas systems of the present invention can be used to alter any target polynucleotide sequence in a cell. Those skilled in the art will readily appreciate that desirable target polynucleotide sequences to be altered in any particular cell may correspond to any genomic sequence for which expression of the genomic sequence is associated with a disorder or otherwise facilitates entry of a pathogen into the cell. For example, a desirable target polynucleotide sequence to alter in a cell may be a polynucleotide sequence corresponding to a genomic sequence which contains a disease associated single polynucleotide polymorphism (e.g., sickle cell disease, e.g., sickle cell anemia). In such example, the CRISPR/Cas systems of the present invention can be used to correct the disease associated SNP by replacing it with a wild-type allele (e.g., replacing a Glu6Val SNP in hemoglobin S to Val6Glu, a Glu6Lys SNP in hemoglobin C to Lys6Glu, a Glu121Gln SNP in hemoglobin D to Gln121Glu, a Glu121Lys SNP in hemoglobin 0 to Lys121Glu). As another example, a polynucleotide sequence of a target gene which is responsible for entry or proliferation of a pathogen into a cell may be a suitable target for deletion or insertion to disrupt the function of the target gene to prevent the pathogen from entering the cell or proliferating inside the cell.

In some embodiments, the target polynucleotide sequence is a genomic sequence. In some embodiments, the target polynucleotide sequence is a human genomic sequence. In some embodiments, the target polynucleotide sequence is a mammalian genomic sequence. In some embodiments, the target polynucleotide sequence is a vertebrate genomic sequence. In some embodiments, the target sequence is a mutant or variant genomic sequence (e.g., a ADA mutant, a AK2 mutant, a CD3D mutant, a DCLRE1C mutant, a IL2RG mutant, a IL7R mutant, a JAK3 mutant, a LIG4 mutant, a NHEJ1 mutant, a PNP mutant, a PRKDC mutant, a RAG1 mutant, a RAG2 mutant, a ZAP70 mutant, a HBB mutant). In some embodiments, the target polynucleotide sequence is a human genomic mutant or variant sequence (e.g., ADA, AK2, CD3D, DCLRE1C, IL2RG, IL7R, JAK3, LIG4, NHEJ1, PNP, PRKDC, RAG1, RAG2, ZAP70, HBB). In some embodiments, the target polynucleotide sequence is a mutant or variant mammalian genomic sequence. In some embodiments, the target polynucleotide sequence is a mammalian mutant or variant genomic sequence.

In some embodiments, a target polynucleotide sequence is a pathogenic genomic sequence. Exemplary pathogenic genomic sequences include, but are not limited to a viral genomic sequence, a bacterial genomic sequence, a fungal genomic sequence, a toxin genomic sequence, or a parasitic genomic sequence. In such embodiments, the CRISPR/Cas systems of the present invention can be used to disrupt the function of a pathogen (e.g., to treat or prevent an infection by the pathogen) by cleaving a genomic sequence of the pathogen (e.g., a genomic sequence that is critical for entry into a cell, or responsible for multiplication, growth or survival once the pathogen is inside a cell).

In some embodiments, the target polynucleotide sequence is an SCID-associated polynucleotide sequence.

In some embodiments, the target polynucleotide sequence is ADA or a portion thereof. In some embodiments, the target polynucleotide sequence is a variant of ADA or a portion thereof. In some embodiments, the target polynucleotide sequence is a homolog of ADA or a portion thereof. In some embodiments, the target polynucleotide sequence is an ortholog of ADA or a portion thereof.

In some embodiments, the target polynucleotide sequence is AK2 or a portion thereof. In some embodiments, the target polynucleotide sequence is a variant of AK2 or a portion thereof. In some embodiments, the target polynucleotide sequence is AK2 coding sequence 1 or a portion thereof. In some embodiments, the target polynucleotide sequence is AK2 coding sequence 2 or a portion thereof. In some embodiments, the target polynucleotide sequence is AK2 coding sequence 3 or a portion thereof. In some embodiments, the target polynucleotide sequence is a homolog of AK2 or a portion thereof. In some embodiments, the target polynucleotide sequence is an ortholog of AK2 or a portion thereof.

In some embodiments, the target polynucleotide sequence is CD3D or a portion thereof. In some embodiments, the target polynucleotide sequence is a variant of CD3D or a portion thereof. In some embodiments, the target polynucleotide sequence is CD3D coding sequence 1 or a portion thereof. In some embodiments, the target polynucleotide sequence is CD3D coding sequence 2 or a portion thereof. In some embodiments, the target polynucleotide sequence is a homolog of CD3D or a portion thereof. In some embodiments, the target polynucleotide sequence is an ortholog of CD3D or a portion thereof.

In some embodiments, the target polynucleotide sequence is DCLRE1C or a portion thereof. In some embodiments, the target polynucleotide sequence is a variant of DCLRE1C or a portion thereof. In some embodiments, the target polynucleotide sequence is DCLRE1C coding sequence 1 or a portion thereof. In some embodiments, the target polynucleotide sequence is DCLRE1C coding sequence 1 or a portion thereof. In some embodiments, the target polynucleotide sequence is DCLRE1C coding sequence 2 or a portion thereof. In some embodiments, the target polynucleotide sequence is DCLRE1C coding sequence 3 or a portion thereof. In some embodiments, the target polynucleotide sequence is DCLRE1C coding sequence 4 or a portion thereof. In some embodiments, the target polynucleotide sequence is a homolog of DCLRE1C or a portion thereof. In some embodiments, the target polynucleotide sequence is an ortholog of DCLRE1C or a portion thereof.

In some embodiments, the target polynucleotide sequence is IL2RG or a portion thereof. In some embodiments, the target polynucleotide sequence is a variant of IL2RG or a portion thereof. In some embodiments, the target polynucleotide sequence is a homolog of IL2RG or a portion thereof. In some embodiments, the target polynucleotide sequence is an ortholog of IL2RG or a portion thereof.

In some embodiments, the target polynucleotide sequence is IL7R or a portion thereof. In some embodiments, the target polynucleotide sequence is a variant of IL7R or a portion thereof. In some embodiments, the target polynucleotide sequence is a homolog of IL7R or a portion thereof. In some embodiments, the target polynucleotide sequence is an ortholog of IL7R or a portion thereof.

In some embodiments, the target polynucleotide sequence is JAK3 or a portion thereof. In some embodiments, the target polynucleotide sequence is a variant of JAK3 or a portion thereof. In some embodiments, the target polynucleotide sequence is a homolog of JAK3 or a portion thereof. In some embodiments, the target polynucleotide sequence is an ortholog of JAK3 or a portion thereof.

In some embodiments, the target polynucleotide sequence is LIG4 or a portion thereof. In some embodiments, the target polynucleotide sequence is a variant of LIG4 or a portion thereof. In some embodiments, the target polynucleotide sequence is LIG4 coding sequence 1 or a portion thereof. In some embodiments, the target polynucleotide sequence is LIG4 coding sequence 2 or a portion thereof. In some embodiments, the target polynucleotide sequence is LIG4 coding sequence 3 or a portion thereof. In some embodiments, the target polynucleotide sequence is a homolog of LIG4 or a portion thereof. In some embodiments, the target polynucleotide sequence is an ortholog of LIG4 or a portion thereof.

In some embodiments, the target polynucleotide sequence is NHEJ1 or a portion thereof. In some embodiments, the target polynucleotide sequence is a variant of NHEJ1 or a portion thereof. In some embodiments, the target polynucleotide sequence is a homolog of NHEJ1 or a portion thereof. In some embodiments, the target polynucleotide sequence is an ortholog of NHEJ1 or a portion thereof.

In some embodiments, the target polynucleotide sequence is PNP or a portion thereof. In some embodiments, the target polynucleotide sequence is a variant of PNP or a portion thereof. In some embodiments, the target polynucleotide sequence is a homolog of PNP or a portion thereof. In some embodiments, the target polynucleotide sequence is an ortholog of PNP or a portion thereof.

In some embodiments, the target polynucleotide sequence is PRKDC or a portion thereof. In some embodiments, the target polynucleotide sequence is a variant of PRKDC or a portion thereof. In some embodiments, the target polynucleotide sequence is PRKDC coding sequence 1 or a portion thereof. In some embodiments, the target polynucleotide sequence is PRKDC coding sequence 2 or a portion thereof. In some embodiments, the target polynucleotide sequence is a homolog of PRKDC or a portion thereof. In some embodiments, the target polynucleotide sequence is an ortholog of PRKDC or a portion thereof.

In some embodiments, the target polynucleotide sequence is RAG1 or a portion thereof. In some embodiments, the target polynucleotide sequence is a variant of RAG1 or a portion thereof. In some embodiments, the target polynucleotide sequence is a homolog of RAG1 or a portion thereof. In some embodiments, the target polynucleotide sequence is an ortholog of RAG1 or a portion thereof.

In some embodiments, the target polynucleotide sequence is RAG2 or a portion thereof. In some embodiments, the target polynucleotide sequence is a variant of RAG2 or a portion thereof. In some embodiments, the target polynucleotide sequence is RAG2 coding sequence 1 or a portion thereof. In some embodiments, the target polynucleotide sequence is RAG2 coding sequence 2 or a portion thereof. In some embodiments, the target polynucleotide sequence is RAG2 coding sequence 3 or a portion thereof. In some embodiments, the target polynucleotide sequence is a homolog of RAG2 or a portion thereof. In some embodiments, the target polynucleotide sequence is an ortholog of RAG2 or a portion thereof.

In some embodiments, the target polynucleotide sequence is ZAP70 or a portion thereof. In some embodiments, the target polynucleotide sequence is a variant of ZAP70 or a portion thereof. In some embodiments, the target polynucleotide sequence is ZAP70 coding sequence 1 or a portion thereof. In some embodiments, the target polynucleotide sequence is ZAP70 coding sequence 2 or a portion thereof. In some embodiments, the target polynucleotide sequence is a homolog of ZAP70 or a portion thereof. In some embodiments, the target polynucleotide sequence is an ortholog of ZAP70 or a portion thereof.

In some embodiments, the target polynucleotide sequence is a SCD-associated polynucleotide sequence (e.g., a mutant form of HBB; NCBI Gene ID: 3043) or a portion thereof. In some embodiments, the target polynucleotide sequence is a mutant homolog of a SCD-associated polynucleotide sequence (e.g., a mutated homolog of HBB) or a portion thereof. In some embodiments, the target polynucleotide sequence is a mutant ortholog of a SCD-associated polynucleotide sequence (e.g., a mutated ortholog of HBB) or a portion thereof.

In some embodiments, the target polynucleotide sequence is a beta thalassemia-associated polynucleotide sequence (e.g., a mutant form of HBB).

In some embodiments, the target polynucleotide sequence is a mutant homolog of a beta thalassemia-associated polynucleotide sequence (e.g., a mutated homolog of HBB) or a portion thereof. In some embodiments, the target polynucleotide sequence is a mutant ortholog of a beta thalassemia-associated polynucleotide sequence (e.g., a mutated ortholog of HBB) or a portion thereof. The relevant portions of these target polynucleotide sequences correspond to the guide sequences shown in FIGS. 1, 2-4, 5-6, 7-10, 12, 13, 14, 15-17, 18, 19, 20-21, 22, 23-25, 26-27, and 11, respectively.

It should be appreciated that the CRISPR/Cas systems of the present invention can cleave target polynucleotide sequences in a variety of ways. In some embodiments, the target polynucleotide sequence is cleaved such that a double-strand break results. In some embodiments, the target polynucleotide sequence is cleaved such that a single-strand break results.

The methods of the present invention can be used to alter any target polynucleotide sequence in a cell, as long as the target polynucleotide sequence in the cell contains a suitable target motif that allows at least one ribonucleic acid of the CRISPR/Cas system to direct the Cas protein to and hybridize to the target motif. Those skilled in the art will appreciate that the target motif for targeting a particular polynucleotide depends on the CRISPR/Cas system being used, and the sequence of the polynucleotide to be targeted.

In some embodiments, the target motif is at least 20 bp in length. In some embodiments, the target motif is a 20-nucleotide DNA sequence. In some embodiments, the target motif is a 20-nucleotide DNA sequence beginning with G and immediately precedes an NGG motif recognized by the Cas protein. In some embodiments, the target motif is G(N)₁₉NGG. In some embodiments, the target motif is a 20-nucleotide DNA sequence and immediately precedes an NGG motif recognized by the Cas protein. In some embodiments, the target motif is (N)₂₀NGG. It is to be understood that the type of target motif for each of the ADA, AK2, CD3D, DCLRE1C, HBB, IL2RG, IL7R, JAK3, LIG4, NHEJ1, PNP, PRKDC, RAG1, RAG2, and ZAP70 target polynucleotide sequences can be found in the “site_type” column of FIGS. 1, 2-4, 5-6, 7-10, 11, 12, 13, 14, 15-17, 18, 19, 20-21, 22, 23-25, and 26-27, respectively.

The target motifs of the present invention can be selected to minimize off-target effects of the CRISPR/Cas systems of the present invention. In some embodiments, the target motif is selected such that it contains at least two mismatches when compared with all other genomic nucleotide sequences in the cell. In some embodiments, the target motif is selected such that it contains at least one mismatch when compared with all other genomic nucleotide sequences in the cell. Those skilled in the art will appreciate that a variety of techniques can be used to select suitable target motifs for minimizing off-target effects (e.g., bioinformatics analyses).

In some embodiments, the CRISPR/Cas systems of the present invention utilize homology-directed repair to correct target polynucleotide sequences. In some embodiments, subsequent to cleavage of the target polynucleotide sequence, homology-directed repair occurs. In some embodiments, homology-directed repair is performed using an exogenously introduced DNA repair template. The exogenously introduced DNA repair template can be single-stranded or double-stranded. The DNA repair template can be of any length. Those skilled in the art will appreciate that the length of any particular DNA repair template will depend on the target polynucleotide sequence that is to be corrected. The DNA repair template can be designed to repair or replace any target polynucleotide sequence, particularly target polynucleotide sequences comprising disease associated polymorphisms (e.g., SNPs). For example, homology-directed repair of a mutant allele comprising such SNPs can be achieved with a CRISPR/Cas system by selecting two target motifs which flank the mutant allele, and an designing a DNA repair template to match the wild-type allele.

In an exemplary embodiment, a cleaved target SCID-associated polynucleotide associated sequence is corrected by homology-directed repair utilizing a corresponding normal wild-type gene sequence as a DNA repair template.

In an exemplary embodiment, a cleaved target SCID-associated polynucleotide associated sequence (i.e., mutant ADA) is corrected by homology-directed repair utilizing a normal wild-type ADA sequence or portions thereof as a DNA repair template.

In an exemplary embodiment, a cleaved target SCID-associated polynucleotide associated sequence (i.e., mutant AK2) is corrected by homology-directed repair utilizing a normal wild-type AK2 sequence (e.g., wild-type AK2 coding sequence 1, AK2 coding sequence 2, and AK2 coding sequence 3 or portions thereof) as a DNA repair template.

In an exemplary embodiment, a cleaved target SCID-associated polynucleotide associated sequence (i.e., mutant CD3D) is corrected by homology-directed repair utilizing a normal wild-type CD3D sequence (e.g., wild-type CD3D coding sequence 1, and CD3D coding sequence 2 or portions thereof) as a DNA repair template.

In an exemplary embodiment, a cleaved target SCID-associated polynucleotide associated sequence (i.e., mutant DCLRE1C) is corrected by homology-directed repair utilizing a normal wild-type DCLRE1C sequence (e.g., wild-type DCLRE1C coding sequence 1, DCLRE1C coding sequence 2, DCLRE1C coding sequence 3, and DCLRE1C coding sequence 4 or portions thereof) as a DNA repair template.

In an exemplary embodiment, a cleaved target SCID-associated polynucleotide associated sequence (i.e., mutant IL2RG) is corrected by homology-directed repair utilizing a normal wild-type IL2RG sequence or portions thereof as a DNA repair template.

In an exemplary embodiment, a cleaved target SCID-associated polynucleotide associated sequence (i.e., mutant IL7R) is corrected by homology-directed repair utilizing a normal wild-type IL7R sequence or portions thereof as a DNA repair template.

In an exemplary embodiment, a cleaved target SCID-associated polynucleotide associated sequence (i.e., mutant JAK3) is corrected by homology-directed repair utilizing a normal wild-type JAK3 sequence or portions thereof as a DNA repair template.

In an exemplary embodiment, a cleaved target SCID-associated polynucleotide associated sequence (i.e., mutant LIG4) is corrected by homology-directed repair utilizing a normal wild-type LIG4 sequence (e.g., wild-type LIG4 coding sequence 1, LIG4 coding sequence 2, LIG4 coding sequence 3, or portions thereof) as a DNA repair template.

In an exemplary embodiment, a cleaved target SCID-associated polynucleotide associated sequence (i.e., mutant NHEJ1) is corrected by homology-directed repair utilizing a normal wild-type NHEJ1 sequence or portions thereof as a DNA repair template.

In an exemplary embodiment, a cleaved target SCID-associated polynucleotide associated sequence (i.e., mutant PNP) is corrected by homology-directed repair utilizing a normal wild-type PNP sequence or portions thereof as a DNA repair template.

In an exemplary embodiment, a cleaved target SCID-associated polynucleotide associated sequence (i.e., mutant PRKDC) is corrected by homology-directed repair utilizing a normal wild-type PRKDC sequence (e.g., wild-type PRKDC coding sequence 1, PRKDC coding sequence 2, or portions thereof) as a DNA repair template.

In an exemplary embodiment, a cleaved target SCID-associated polynucleotide associated sequence (i.e., mutant RAG1) is corrected by homology-directed repair utilizing a normal wild-type RAG1 sequence or portions thereof as a DNA repair template.

In an exemplary embodiment, a cleaved target SCID-associated polynucleotide associated sequence (i.e., mutant RAG2) is corrected by homology-directed repair utilizing a normal wild-type RAG2 sequence (e.g., wild-type RAG2 coding sequence 1, RAG2 coding sequence 2, RAG2 coding sequence 3, or portions thereof) as a DNA repair template.

In an exemplary embodiment, a cleaved target SCID-associated polynucleotide associated sequence (i.e., mutant ZAP70) is corrected by homology-directed repair utilizing a normal wild-type ZAP70 sequence (e.g., wild-type ZAP70 coding sequence 1, ZAP70 coding sequence 2, or portions thereof) as a DNA repair template.

In an exemplary embodiment, a cleaved target SCD-associated polynucleotide associated sequence is corrected by homology-directed repair utilizing a normal wild-type HBB sequence or portions thereof as a DNA repair template.

In an exemplary embodiment, a cleaved target beta thalassemia-associated polynucleotide associated sequence is corrected by homology-directed repair utilizing a normal wild-type HBB sequence or portions thereof as a DNA repair template.

In some embodiments, a CRISPR/Cas system of the present invention includes a Cas protein and at least one to two one ribonucleic acids that are capable of directing the Cas protein to and hybridizing to a target motif of a target polynucleotide sequence.

As used herein, “protein” and “polypeptide” are used interchangeably to refer to a series of amino acid residues joined by peptide bonds (i.e., a polymer of amino acids) and include modified amino acids (e.g., phosphorylated, glycated, glycosolated, etc.) and amino acid analogs. Exemplary polypeptides or proteins include gene products, naturally occurring proteins, homologs, paralogs, fragments and other equivalents, variants, and analogs of the above.

In some embodiments, a Cas protein comprises one or more amino acid substitutions or modifications. In some embodiments, the one or more amino acid substitutions comprises a conservative amino acid substitution. In some instances, substitutions and/or modifications can prevent or reduce proteolytic degradation and/or extend the half-life of the polypeptide in a cell. In some embodiments, the Cas protein can comprise a peptide bond replacement (e.g., urea, thiourea, carbamate, sulfonyl urea, etc.). In some embodiments, the Cas protein can comprise a naturally occurring amino acid. In some embodiments, the Cas protein can comprise an alternative amino acid (e.g., D-amino acids, beta-amino acids, homocysteine, phosphoserine, etc.). In some embodiments, a Cas protein can comprise a modification to include a moiety (e.g., PEGylation, glycosylation, lipidation, acetylation, end-capping, etc.).

In some embodiments, a Cas protein comprises a core Cas protein. Exemplary Cas core proteins include, but are not limited to Cas1, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8 and Cas9. In some embodiments, a Cas protein comprises a Cas protein of an E. coli subtype (also known as CASS2). Exemplary Cas proteins of the E. Coli subtype include, but are not limited to Cse1, Cse2, Cse3, Cse4, and Cas5e. In some embodiments, a Cas protein comprises a Cas protein of the Ypest subtype (also known as CASS3). Exemplary Cas proteins of the Ypest subtype include, but are not limited to Csy1, Csy2, Csy3, and Csy4. In some embodiments, a Cas protein comprises a Cas protein of the Nmeni subtype (also known as CASS4). Exemplary Cas proteins of the Nmeni subtype include, but are not limited to Csn1 and Csn2. In some embodiments, a Cas protein comprises a Cas protein of the Dvulg subtype (also known as CASS1). Exemplary Cas proteins of the Dvulg subtype include Csd1, Csd2, and Cas5d. In some embodiments, a Cas protein comprises a Cas protein of the Tneap subtype (also known as CASS7). Exemplary Cas proteins of the Tneap subtype include, but are not limited to, Cst1, Cst2, Cas5t. In some embodiments, a Cas protein comprises a Cas protein of the Hmari subtype. Exemplary Cas proteins of the Hmari subtype include, but are not limited to Csh1, Csh2, and Cas5h. In some embodiments, a Cas protein comprises a Cas protein of the Apern subtype (also known as CASS5). Exemplary Cas proteins of the Apern subtype include, but are not limited to Csa1, Csa2, Csa3, Csa4, Csa5, and Cas5a. In some embodiments, a Cas protein comprises a Cas protein of the Mtube subtype (also known as CASS6). Exemplary Cas proteins of the Mtube subtype include, but are not limited to Csm1, Csm2, Csm3, Csm4, and Csm5. In some embodiments, a Cas protein comprises a RAMP module Cas protein. Exemplary RAMP module Cas proteins include, but are not limited to, Cmr1, Cmr2, Cmr3, Cmr4, Cmr5, and Cmr6.

In some embodiments, the Cas protein is a Streptococcus pyogenes Cas9 protein or a functional portion thereof. In some embodiments, the Cas protein is Cas9 protein from any bacterial species or functional portion thereof. Cas9 protein is a member of the type II CRISPR systems which typically include a trans-coded small RNA (tracrRNA), endogenous ribonuclease 3 (mc) and a Cas protein. Cas9 protein (also known as CRISPR-associated endonuclease Cas9/Csn1) is a polypeptide comprising 1368 amino acids. An exemplary amino acid sequence of a Cas9 protein (SEQ ID NO: 298) is shown in FIG. 28. Cas 9 contains 2 enconuclease domains, including an RuvC-like domain (residues 7-22, 759-766 and 982-989) which cleaves target DNA that is noncomplementary to crRNA, and an HNH nuclease domain (residues 810-872) which cleave target DNA complementary to crRNA. In FIG. 28, the RuvC-like domain is highlighted in yellow and the HNH nuclease domain is underlined.

As used herein, “functional portion” refers to a portion of a peptide which retains its ability to complex with at least one ribonucleic acid (e.g., guide RNA (gRNA)) and cleave a target polynucleotide sequence. In some embodiments, the functional portion comprises a combination of operably linked Cas9 protein functional domains selected from the group consisting of a DNA binding domain, at least one RNA binding domain, a helicase domain, and an endonuclease domain. In some embodiments, the functional domains form a complex.

In some embodiments, a functional portion of the Cas9 protein comprises a functional portion of a RuvC-like domain. In some embodiments, a functional portion of the Cas9 protein comprises a functional portion of the HNH nuclease domain.

It should be appreciated that the present invention contemplates various of ways of contacting a target polynucleotide sequence with a Cas protein (e.g., Cas9). In some embodiments, exogenous Cas protein can be introduced into the cell in polypeptide form. In certain embodiments, Cas proteins can be conjugated to or fused to a cell-penetrating polypeptide or cell-penetrating peptide. As used herein, “cell-penetrating polypeptide” and “cell-penetrating peptide” refers to a polypeptide or peptide, respectively, which facilitates the uptake of molecule into a cell. The cell-penetrating polypeptides can contain a detectable label.

In certain embodiments, Cas proteins can be conjugated to or fused to a charged protein (e.g., that carries a positive, negative or overall neutral electric charge). Such linkage may be covalent. In some embodiments, the Cas protein can be fused to a superpositively charged GFP to significantly increase the ability of the Cas protein to penetrate a cell (Cronican et al. ACS Chem Biol. 2010; 5(8):747-52).

In certain embodiments, the Cas protein can be fused to a protein transduction domain (PTD) to facilitate its entry into a cell. Exemplary PTDs include Tat, oligoarginine, and penetratin.

In some embodiments, the Cas9 protein comprises a Cas9 polypeptide fused to a cell-penetrating peptide. In some embodiments, the Cas9 protein comprises a Cas9 polypeptide fused to a PTD. In some embodiments, the Cas9 protein comprises a Cas9 polypeptide fused to a tat domain. In some embodiments, the Cas9 protein comprises a Cas9 polypeptide fused to an oligoarginine domain. In some embodiments, the Cas9 protein comprises a Cas9 polypeptide fused to a penetratin domain. In some embodiments, the Cas9 protein comprises a Cas9 polypeptide fused to a superpositively charged GFP.

In some embodiments, the Cas protein can be introduced into a cell containing the target polynucleotide sequence in the form of a nucleic acid encoding the Cas protein (e.g., Cas9). The process of introducing the nucleic acids into cells can be achieved by any suitable technique. Suitable techniques include calcium phosphate or lipid-mediated transfection, electroporation, and transduction or infection using a viral vector. In some embodiments, the nucleic acid comprises DNA. In some embodiments, the nucleic acid comprises a modified DNA, as described herein. In some embodiments, the nucleic acid comprises mRNA. In some embodiments, the nucleic acid comprises a modified mRNA, as described herein (e.g., a synthetic, modified mRNA).

In some embodiments, the Cas protein is complexed with the one to two ribonucleic acids. In some embodiments, the Cas protein and the one to two ribonucleic acids are contained in a nanoparticle. In some embodiments, the Cas protein and the one to two ribonucleic acids are contained in a lipid nanoparticle, as described herein. In some embodiments, the Cas protein is encoded by a modified nucleic acid, as described herein (e.g., a synthetic, modified mRNA).

The methods of the present invention contemplate the use of any ribonucleic acid that is capable of directing a Cas protein to and hybridizing to a target motif of a target polynucleotide sequence. In some embodiments, at least one of the ribonucleic acids comprises tracrRNA. In some embodiments, at least one of the ribonucleic acids comprises CRISPR RNA (crRNA). In some embodiments, at least one of the ribonucleic acids comprises a guide RNA that directs the Cas protein to and hybridizes to a target motif of the target polynucleotide sequence in a cell.

The ribonucleic acids of the present invention can be selected to hybridize to a variety of different target motifs, depending on the particular CRISPR/Cas system employed, and the sequence of the target polynucleotide, as will be appreciated by those skilled in the art. The one to two ribonucleic acids can also be selected to minimize hybridization with nucleic acid sequences other than the target polynucleotide sequence. In some embodiments, the one to two ribonucleic acids hybridize to a target motif that contains at least two mismatches when compared with all other genomic nucleotide sequences in the cell. In some embodiments, the one to two ribonucleic acids hybridize to a target motif that contains at least one mismatch when compared with all other genomic nucleotide sequences in the cell. In some embodiments, the one to two ribonucleic acids are designed to hybridize to a target motif immediately adjacent to a deoxyribonucleic acid motif recognized by the Cas protein. In some embodiments, each of the one to two ribonucleic acids are designed to hybridize to target motifs immediately adjacent to deoxyribonucleic acid motifs recognized by the Cas protein which flank a mutant allele located between the target motifs.

In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequence of GTAACGGCAGACTTCTCCAC (SEQ ID NO: 5322). In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). It should be appreciated that the former sequence is the protospacer sequence in the guide RNA, whereas the latter sequence is the protospacer plus the PAM.

In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 1.

In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 1.

In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 2. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 2.

In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 3. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 3.

In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 4. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 4.

In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 5. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 5.

In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 6. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 6.

In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 7. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 7.

In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 8. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 8.

In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 9. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 9.

In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 10. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 10.

In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 11. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 11.

In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 12. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 12.

In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 13. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 13.

In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 14. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 14.

In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 15. In some embodiments, at least one of the one to two ribonucleic acids comprises a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 15.

In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above do not include the 3 nucleotide NGG sequence. For example, if the target site sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one of the one to two ribonucleic acid sequences is GATGCTCAGTACAGCCACCT (SEQ ID NO: 5324). As another example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5325), a ribonucleic acid sequence with a single nucleotide mismatch which does not include the 3 nucleotide NGG sequence is GATGCTGAGTACAGCCACCT (SEQ ID NO: 5326), with the italicized G being the mismatched nucleotide. Those skilled in the art will appreciate, however, that the single nucleotide mismatch can comprise any nucleotide in the ribonucleic acid, e.g., the first nucleotide, the second nucleotide, the third nucleotide, the fourth nucleotide, the fifth nucleotide, the sixth nucleotide, the seventh nucleotide, the eighth nucleotide, the ninth nucleotide, the tenth nucleotide, the eleventh nucleotide, the twelfth nucleotide, the thirteenth nucleotide, the fourteenth nucleotide, the fifteenth nucleotide, the sixteenth nucleotide, the seventeenth nucleotide, the eighteenth nucleotide, the nineteenth nucleotide, or the twentieth nucleotide of the ribonucleic acid.

In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 12 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 12 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 12 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 12 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one of the one to two ribonucleic acids which comprises at least a 12 nucleotide fragment is GTACAGCCACCT (SEQ ID NO: 5327).

In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 13 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 13 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 13 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 13 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one of the one to two ribonucleic acids which comprises at least a 13 nucleotide fragment is AGTACAGCCACCT (SEQ ID NO: 5328).

In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 14 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 14 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 14 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 14 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one of the one to two ribonucleic acids which comprises at least a 14 nucleotide fragment is CAGTACAGCCACCT (SEQ ID NO: 5329).

In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 15 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 15 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 15 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 15 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one of the one to two ribonucleic acids which comprises at least a 15 nucleotide fragment is TCAGTACAGCCACCT (SEQ ID NO: 5330).

In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 16 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 16 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 16 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 16 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one of the one to two ribonucleic acids which comprises at least a 16 nucleotide fragment is CTCAGTACAGCCACCT (SEQ ID NO: 5331).

In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 17 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 17 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 17 nucleotide fragment of a ribonucleic acid sequence of any of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 17 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one of the one to two ribonucleic acids which comprises at least a 17 nucleotide fragment is GCTCAGTACAGCCACCT (SEQ ID NO: 5332).

In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 18 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 18 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 18 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 18 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one of the one to two ribonucleic acids which comprises at least a 18 nucleotide fragment is TGCTCAGTACAGCCACCT (SEQ ID NO: 5333).

In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 19 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 19 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 19 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 19 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one of the one to two ribonucleic acids which comprises at least a 19 nucleotide fragment is ATGCTCAGTACAGCCACCT (SEQ ID NO: 5334).

In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 20 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 20 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 20 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequences of the at least one of the one to two ribonucleic acids described above comprise at least a 20 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one of the one to two ribonucleic acids which comprises at least a 20 nucleotide fragment is GATGCTCAGTACAGCCACCT (SEQ ID NO: 5324).

The present invention also contemplates multiplex genomic editing. Those skilled in the art will appreciate that the description above with respect to genomic editing of a single gene is equally applicable to the multiplex genomic editing embodiments described below.

In another aspect, the present invention provides a method for simultaneously altering multiple target polynucleotide sequences in a cell.

In some embodiments, a method for simultaneously altering multiple target polynucleotide sequences in a cell comprises a method for simultaneously altering multiple target SCID-associated polynucleotides in a cell.

An exemplary method for simultaneously altering multiple target SCID-asssociated polynucleotide sequences in a cell comprises contacting the SCID-associated polynucleotide sequences with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target SCID-associated polynucleotide sequences, wherein the target SCID-associated polynucleotide sequences are cleaved. In some embodiments, the efficiency of alteration of cells that express Cas protein is from about 50% to about 80%.

In some embodiments, a method for simultaneously altering multiple target polynucleotide sequences in a cell comprises a method for simultaneously altering multiple target SCD-associated polynucleotides in a cell.

An exemplary method for simultaneously altering multiple target SCD-asssociated polynucleotide sequences in a cell comprises contacting the SCD-associated polynucleotide sequences with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target SCD-associated polynucleotide sequences, wherein the target SCD-associated polynucleotide sequences are cleaved. In some embodiments, the efficiency of alteration of cells that express Cas protein is from about 50% to about 80%.

In some embodiments, a method for simultaneously altering multiple target polynucleotide sequences in a cell comprises a method for simultaneously altering multiple target beta thalassemia-associated polynucleotides in a cell.

An exemplary method for simultaneously altering multiple target beta thalassemia-associated polynucleotide sequences in a cell comprises contacting the polynucleotide sequences with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target beta thalassemia-associated polynucleotide sequences, wherein the target beta thalassemia-associated polynucleotide sequences are cleaved. In some embodiments, the efficiency of alteration of cells that express Cas protein is from about 50% to about 80%.

In yet another aspect, the present invention provides a method for treating or preventing a disorder associated with expression of polynucleotide sequences in a subject.

In some embodiments, a method for treating or preventing a disorder associated with expression of polynucleotide sequences in a subject comprises a method for treating or preventing a disorder associated with expression of SCID-associated polynucleotide sequences in a subject.

An exemplary method for treating or preventing a disorder associated with expression of SCID-associated polynucleotide sequences in a subject comprises (a) altering target SCID-associated polynucleotide sequences in a cell ex vivo by contacting the SCID-associated polynucleotide sequences with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target SCID-associated polynucleotide sequences, wherein the target SCID-associated polynucleotide sequences are cleaved, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the SCID-associated polynucleotide sequences. In some embodiments, the efficiency of alteration of cells that express Cas protein is from about 50% to about 80%. In some embodiments, the method includes the step of contacting, before the step of introducing the cell into the subject, the cleaved SCID-associated polynucleotide sequences with an exogenously introduced DNA repair template comprising a corresponding wild-type or normal polynucleotide sequence, thereby allowing homology-directed repair to replace the cleaved SCID-associated polynucleotide sequence with the wild-type or normal gene sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise ADA polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal ADA polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise AK2 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal AK2 polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise CD3D polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal CD3D polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise DCLRE1C polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal DCLRE1C polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise IL2RG polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal IL2RG polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise IL7R polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal IL7R polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise JAK3 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal JAK3 polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise LIG4 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal LIG4 polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise NHEJ1 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal NHEJ1 polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise PNP polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal PNP polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise PRKDC polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal PRKDC polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise RAG1 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal RAG1 polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise RAG2 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal RAG2 polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise ZAP70 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal ZAP70 polynucleotide sequence.

In some embodiments, a method for treating or preventing a disorder associated with expression of polynucleotide sequences in a subject comprises a method for treating or preventing a disorder associated with expression of SCD-associated polynucleotide sequences in a subject.

An exemplary method for treating or preventing a disorder associated with expression of SCD-associated polynucleotide sequences in a subject comprises (a) altering target SCD-associated polynucleotide sequences in a cell ex vivo by contacting the SCD-associated polynucleotide sequences with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target SCD-associated polynucleotide sequences, wherein the target SCD-associated polynucleotide sequences are cleaved, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the SCD-associated polynucleotide sequences. In some embodiments, the efficiency of alteration of cells that express Cas protein is from about 50% to about 80%. In some embodiments, the method includes the step of contacting, before the step of introducing the cell into the subject, the cleaved SCD-associated polynucleotide sequences with an exogenously introduced DNA repair template comprising a normal HBB sequence, thereby allowing homology-directed repair to replace the cleaved SCD-associated polynucleotide sequence with the normal HBB sequence.

In some embodiments, a method for treating or preventing a disorder associated with expression of polynucleotide sequences in a subject comprises a method for treating or preventing a disorder associated with expression of beta thalassemia-associated polynucleotide sequences in a subject.

An exemplary method for treating or preventing a disorder associated with expression of beta thalassemia-associated polynucleotide sequences in a subject comprises (a) altering target beta thalassemia-associated polynucleotide sequences in a cell ex vivo by contacting the beta thalassemia-associated polynucleotide sequences with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target beta thalassemia-associated polynucleotide sequences, wherein the target beta thalassemia-associated polynucleotide sequences are cleaved, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the beta thalassemia-associated polynucleotide sequences. In some embodiments, the efficiency of alteration of cells that express Cas protein is from about 50% to about 80%,

In some embodiments, the method includes the step of contacting, before the step of introducing the cell into the subject, the cleaved beta thalassemia-associated polynucleotide sequences with an exogenously introduced DNA repair template comprising a normal HBB sequence, thereby allowing homology-directed repair to replace the cleaved beta thalassemia-associated polynucleotide sequence with the normal HBB sequence.

As used herein, the terms “administering,” “introducing” and “transplanting” are used interchangeably in the context of the placement of cells, e.g. cells described herein comprising a target polynucleotide sequence altered according to the methods of the invention into a subject, by a method or route which results in at least partial localization of the introduced cells at a desired site. The cells can be implanted directly to the desired site, or alternatively be administered by any appropriate route which results in delivery to a desired location in the subject where at least a portion of the implanted cells or components of the cells remain viable. The period of viability of the cells after administration to a subject can be as short as a few hours, e. g. twenty-four hours, to a few days, to as long as several years. In some instances, the cells can also be administered a location other than the desired site, such as in the liver or subcutaneously, for example, in a capsule to maintain the implanted cells at the implant location and avoid migration of the implanted cells.

For ex vivo methods, cells can include autologous cells, i.e., a cell or cells taken from a subject who is in need of altering a target polynucleotide sequence in the cell or cells (i.e., the donor and recipient are the same individual). Autologous cells have the advantage of avoiding any immunologically-based rejection of the cells. Alternatively, the cells can be heterologous, e.g., taken from a donor. The second subject can be of the same or different species. Typically, when the cells come from a donor, they will be from a donor who is sufficiently immunologically compatible with the recipient, i.e., will not be subject to transplant rejection, to lessen or remove the need for immunosuppression. In some embodiments, the cells are taken from a xenogeneic source, i.e., a non-human mammal that has been genetically engineered to be sufficiently immunologically compatible with the recipient, or the recipient's species. Methods for determining immunological compatibility are known in the art, and include tissue typing to assess donor-recipient compatibility for HLA and ABO determinants. See, e.g., Transplantation Immunology, Bach and Auchincloss, Eds. (Wiley, John & Sons, Incorporated 1994).

Any suitable cell culture media can be used for ex vivo methods of the invention.

Another exemplary method for treating or preventing a disorder associated with expression of SCID-associated polynucleotide sequences in a subject comprises altering target SCID-associated polynucleotide sequences in a cell by contacting the SCID-associated polynucleotide sequences with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target moieties of the target SCID-associated polynucleotide sequences, and wherein the target SCID-associated polynucleotide sequences are cleaved, thereby treating or preventing a disorder associated with expression of the polynucleotide sequences.

In some embodiments, the method includes the step of contacting the cleaved SCID-associated polynucleotide sequences with an exogenously introduced DNA repair template comprising a corresponding wild-type or normal polynucleotide sequence, thereby allowing homology-directed repair to replace the cleaved SCID-associated polynucleotide sequence with the corresponding wild-type or normal polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise ADA polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal ADA polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise AK2 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal AK2 polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise CD3D polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal CD3D polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise DCLRE1C polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal DCLRE1C polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise IL2RG polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal IL2RG polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise IL7R polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal IL7R polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise JAK3 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal JAK3 polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise LIG4 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal LIG4 polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise NHEJ1 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal NHEJ1 polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise PNP polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal PNP polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise PRKDC polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal PRKDC polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise RAG1 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal RAG1 polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise RAG2 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal RAG2 polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise ZAP70 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal ZAP70 polynucleotide sequence.

Another exemplary method for treating or preventing a disorder associated with expression of SCD-associated polynucleotide sequences in a subject comprises altering target SCD-associated polynucleotide sequences in a cell by contacting the SCD-associated polynucleotide sequences with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target moieties of the target SCD-associated polynucleotide sequences, and wherein the target SCD-associated polynucleotide sequences are cleaved, thereby treating or preventing a disorder associated with expression of the polynucleotide sequences.

In some embodiments, the method includes the step of contacting the cleaved SCD-associated polynucleotide sequences with an exogenously introduced DNA repair template comprising a normal HBB sequence, thereby allowing homology-directed repair to replace the cleaved SCD-associated polynucleotide sequence with the normal HBB sequence.

Another exemplary method for treating or preventing a disorder associated with expression of beta thalassemia-associated polynucleotide sequences in a subject comprises altering target beta thalassemia-associated polynucleotide sequences in a cell by contacting the beta thalassemia-associated polynucleotide sequences with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target moieties of the target beta thalassemia-associated polynucleotide sequences, and wherein the target beta thalassemia-associated polynucleotide sequences are cleaved, thereby treating or preventing a disorder associated with expression of the beta thalassemia-associated polynucleotide sequences.

In some embodiments, the method includes the step of contacting the cleaved beta thalassemia-associated polynucleotide sequences with an exogenously introduced DNA repair template comprising a normal HBB sequence, thereby allowing homology-directed repair to replace the cleaved beta thalassemia-associated polynucleotide sequence with the normal HBB sequence.

The terms “subject” and “individual” are used interchangeably herein, and refer to an animal, for example, a human from whom cells can be obtained and/or to whom treatment, including prophylactic treatment, with the cells as described herein, is provided. For treatment of those infections, conditions or disease states which are specific for a specific animal such as a human subject, the term subject refers to that specific animal. The “non-human animals” and “non-human mammals” as used interchangeably herein, includes mammals such as rats, mice, rabbits, sheep, cats, dogs, cows, pigs, and non-human primates. The term “subject” also encompasses any vertebrate including but not limited to mammals, reptiles, amphibians and fish. However, advantageously, the subject is a mammal such as a human, or other mammals such as a domesticated mammal, e.g. dog, cat, horse, and the like, or production mammal, e.g. cow, sheep, pig, and the like.

In some embodiments, the alteration results in reduced expression of the target polynucleotide sequences. In exemplary embodiments, the alteration results in reduced expression of the target SCID-associated polynucleotide sequences. In exemplary embodiments, the alteration results in reduced expression of the target ADA polynucleotide sequences. In exemplary embodiments, the alteration results in reduced expression of the target AK2 polynucleotide sequences. In exemplary embodiments, the alteration results in reduced expression of the target CD3D polynucleotide sequences. In exemplary embodiments, the alteration results in reduced expression of the target DCLRE1C polynucleotide sequences. In exemplary embodiments, the alteration results in reduced expression of the target IL2RG polynucleotide sequences. In exemplary embodiments, the alteration results in reduced expression of the target IL7R polynucleotide sequences. In exemplary embodiments, the alteration results in reduced expression of the target JAK3 polynucleotide sequences. In exemplary embodiments, the alteration results in reduced expression of the target LIG4 polynucleotide sequences. In exemplary embodiments, the alteration results in reduced expression of the target NHEJ1 polynucleotide sequences. In exemplary embodiments, the alteration results in reduced expression of the target PNP polynucleotide sequences. In exemplary embodiments, the alteration results in reduced expression of the target PRKDC polynucleotide sequences. In exemplary embodiments, the alteration results in reduced expression of the target RAG1 polynucleotide sequences. In exemplary embodiments, the alteration results in reduced expression of the target RAG2 polynucleotide sequences. In exemplary embodiments, the alteration results in reduced expression of the target ZAP70 polynucleotide sequences. In exemplary embodiments, the alteration results in reduced expression of the target SCD-associated polynucleotide sequences. In exemplary embodiments, the alteration results in reduced expression of the target beta thalassemia-associated polynucleotide sequences. In some embodiments, the alteration results in a knock out of the target polynucleotide sequences. In exemplary embodiments, the alteration results in a knock out of the target SCID-associated polynucleotide sequences. In exemplary embodiments, the alteration results in a knock out of the target ADA polynucleotide sequences. In exemplary embodiments, the alteration results in a knock out of the target AK2 polynucleotide sequences. In exemplary embodiments, the alteration results in a knock out of the target CD3D polynucleotide sequences. In exemplary embodiments, the alteration results in a knock out of the target DCLRE1C polynucleotide sequences. In exemplary embodiments, the alteration results in a knock out of the target IL2RG polynucleotide sequences. In exemplary embodiments, the alteration results in a knock out of the target IL7R polynucleotide sequences. In exemplary embodiments, the alteration results in a knock out of the target JAK3 polynucleotide sequences. In exemplary embodiments, the alteration results in a knock out of the target LIG4 polynucleotide sequences. In exemplary embodiments, the alteration results in a knock out of the target NHEJ1 polynucleotide sequences. In exemplary embodiments, the alteration results in a knock out of the target PNP polynucleotide sequences. In exemplary embodiments, the alteration results in a knock out of the target PRKDC polynucleotide sequences. In exemplary embodiments, the alteration results in a knock out of the target RAG1 polynucleotide sequences. In exemplary embodiments, the alteration results in a knock out of the target RAG2 polynucleotide sequences. In exemplary embodiments, the alteration results in a knock out of the target ZAP70 polynucleotide sequences. In exemplary embodiments, the alteration results in a knock out of the target SCD-associated polynucleotide sequences. In exemplary embodiments, the alteration results in a knock out of the target beta thalassemia-associated polynucleotide sequences.

In some embodiments, the alteration results in correction of the target polynucleotide sequences from undesired sequences to desired sequences. In exemplary embodiments, the alteration results in correction of the target SCID-associated polynucleotide sequences to corresponding normal wild-type polynucleotide sequences. In exemplary embodiments, the alteration results in correction of the target SCID-associated polynucleotide sequences to corresponding normal wild-type ADA sequences. In exemplary embodiments, the alteration results in correction of the target SCID-associated polynucleotide sequences to corresponding normal wild-type AK2 sequences. In exemplary embodiments, the alteration results in correction of the target SCID-associated polynucleotide sequences to corresponding normal wild-type CD3D sequences. In exemplary embodiments, the alteration results in correction of the target SCID-associated polynucleotide sequences to corresponding normal wild-type DCLRE1C sequences. In exemplary embodiments, the alteration results in correction of the target SCID-associated polynucleotide sequences to corresponding normal wild-type IL2RG sequences In exemplary embodiments, the alteration results in correction of the target SCID-associated polynucleotide sequences to corresponding normal wild-type IL7R sequences. In exemplary embodiments, the alteration results in correction of the target SCID-associated polynucleotide sequences to corresponding normal wild-type JAK3 sequences. In exemplary embodiments, the alteration results in correction of the target SCID-associated polynucleotide sequences to corresponding normal wild-type LIG4 sequences. In exemplary embodiments, the alteration results in correction of the target SCID-associated polynucleotide sequences to corresponding normal wild-type NHEJ1 sequences. In exemplary embodiments, the alteration results in correction of the target SCID-associated polynucleotide sequences to corresponding normal wild-type PNP sequences. In exemplary embodiments, the alteration results in correction of the target SCID-associated polynucleotide sequences to corresponding normal wild-type PRKDC sequences. In exemplary embodiments, the alteration results in correction of the target SCID-associated polynucleotide sequences to corresponding normal wild-type RAG1 sequences. In exemplary embodiments, the alteration results in correction of the target SCID-associated polynucleotide sequences to corresponding normal wild-type RAG2 sequences. In exemplary embodiments, the alteration results in correction of the target SCID-associated polynucleotide sequences to corresponding normal wild-type ZAP70 sequences. In exemplary embodiments, the alteration results in correction of the target SCD-associated polynucleotide sequences to normal wild-type HBB sequences. In exemplary embodiments, the alteration results in correction of the target beta thalassemia-associated polynucleotide sequences to normal wild-type HBB sequences. In some embodiments, each alteration is a homozygous alteration. In some embodiments, the efficiency of alteration at each loci is from about 5% to about 80%. In some embodiments, the efficiency of alteration at each loci is from about 10% to about 80%. In some embodiments, the efficiency of alteration at each loci is from about 30% to about 80%. In some embodiments, the efficiency of alteration at each loci is from about 50% to about 80%. In some embodiments, the efficiency of alteration at each loci is from greater than or equal to about 80%.

In some embodiments, each target polynucleotide sequence is cleaved such that a double-strand break results. In some embodiments, each target polynucleotide sequence is cleaved such that a single-strand break results.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of ADA (e.g., one or more mutations in the ADA gene). In some embodiments, the target polynucleotide sequences comprise at least a portion of ADA.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of AK2 (e.g., one or more mutations in the AK2 gene). In some embodiments, the target polynucleotide sequences comprise at least a portion of AK2.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of CD3D (e.g., one or more mutations in the CD3D gene). In some embodiments, the target polynucleotide sequences comprise at least a portion of CD3D.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of DCLRE1C (e.g., one or more mutations in the DCLRE1C gene). In some embodiments, the target polynucleotide sequences comprise at least a portion of DCLRE1C.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of IL2RG (e.g., one or more mutations in the IL2RG gene). In some embodiments, the target polynucleotide sequences comprise at least a portion of IL2RG.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of IL7R (e.g., one or more mutations in the IL7R gene). In some embodiments, the target polynucleotide sequences comprise at least a portion of IL7R.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of JAK3 (e.g., one or more mutations in the JAK3 gene). In some embodiments, the target polynucleotide sequences comprise at least a portion of JAK3.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of LIG4 (e.g., one or more mutations in the LIG4 gene). In some embodiments, the target polynucleotide sequences comprise at least a portion of LIG4.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of NHEJ1 (e.g., one or more mutations in the NHEJ1 gene). In some embodiments, the target polynucleotide sequences comprise at least a portion of NHEJ1.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of PNP (e.g., one or more mutations in the PNP gene). In some embodiments, the target polynucleotide sequences comprise at least a portion of PNP.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of PRKDC (e.g., one or more mutations in the PRKDC gene). In some embodiments, the target polynucleotide sequences comprise at least a portion of PRKDC.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of RAG1 (e.g., one or more mutations in the RAG1 gene). In some embodiments, the target polynucleotide sequences comprise at least a portion of RAG1.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of RAG2 (e.g., one or more mutations in the RAG2 gene). In some embodiments, the target polynucleotide sequences comprise at least a portion of RAG2.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of ZAP70 (e.g., one or more mutations in the ZAP70 gene). In some embodiments, the target polynucleotide sequences comprise at least a portion of ZAP70.

In some embodiments, the target polynucleotide sequences comprise multiple different portions of HBB (e.g., one or more mutations in the HBB gene). In some embodiments, the target polynucleotide sequences comprise at least a portion of HBB.

In some embodiments, each target motif is a 20-nucleotide DNA sequence. In some embodiments, each target motif is a 20-nucleotide DNA sequence beginning with G and immediately precedes an NGG motif recognized by the Cas protein. In some embodiments, each target motif is a 20-nucleotide DNA sequence and immediately precedes an NGG motif recognized by the Cas protein. In some embodiments, each target motif is G(N)19NGG. In some embodiments, each target motif is (N)20NGG. In some embodiments, each target motif is selected such that it contains at least two mismatches when compared with all other genomic nucleotide sequences in the cell. In some embodiments, each target motif is selected such that it contains at least two mismatches when compared with all other genomic nucleotide sequences in the cell.

In some embodiments, subsequent to cleavage of the target polynucleotide sequences, homology-directed repair occurs. In some embodiments, homology-directed repair is performed using an exogenously introduced DNA repair template. In some embodiments, the exogenously introduced DNA repair template is single-stranded. In some embodiments, the exogenously introduced DNA repair template is double-stranded.

In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type ADA sequence. In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type ADA sequence corresponding to the mutant ADA sequence comprising the SCID-associated polynucleotide sequence.

In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type AK2 sequence. In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type AK2 sequence corresponding to the mutant AK2 sequence comprising the SCID-associated polynucleotide sequence.

In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type CD3D sequence. In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type CD3D sequence corresponding to the mutant CD3D sequence comprising the SCID-associated polynucleotide sequence.

In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type DCLRE1C sequence. In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type DCLRE1C sequence corresponding to the mutant DCLRE1C sequence comprising the SCID-associated polynucleotide sequence.

In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type IL2RG sequence. In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type IL2RG sequence corresponding to the mutant IL2RG sequence comprising the SCID-associated polynucleotide sequence.

In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type IL7R sequence. In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type IL7R sequence corresponding to the mutant IL7R sequence comprising the SCID-associated polynucleotide sequence.

In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type JAK3 sequence. In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type JAK3 sequence corresponding to the mutant JAK3 sequence comprising the SCID-associated polynucleotide sequence.

In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type LIG4 sequence. In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type LIG4 sequence corresponding to the mutant LIG4 sequence comprising the SCID-associated polynucleotide sequence.

In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type NHEJ1 sequence. In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type NHEJ1 sequence corresponding to the mutant NHEJ1 sequence comprising the SCID-associated polynucleotide sequence.

In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type PNP sequence. In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type PNP sequence corresponding to the mutant PNP sequence comprising the SCID-associated polynucleotide sequence.

In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type PRKDC sequence. In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type PRKDC sequence corresponding to the mutant PRKDC sequence comprising the SCID-associated polynucleotide sequence.

In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type RAG1 sequence. In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type RAG1 sequence corresponding to the mutant RAG1 sequence comprising the SCID-associated polynucleotide sequence.

In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type RAG2 sequence. In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type RAG2 sequence corresponding to the mutant RAG2 sequence comprising the SCID-associated polynucleotide sequence.

In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type ZAP70 sequence. In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type ZAP70 sequence corresponding to the mutant ZAP70 sequence comprising the SCID-associated polynucleotide sequence.

In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type HBB sequence. In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type HBB sequence corresponding to the mutant HBB sequence comprising the SCD-associated polynucleotide sequence. In some embodiments, the exogenously introduced DNA repair template is a normal or wild-type HBB sequence corresponding to the mutant HBB sequence comprising the beta thalassemia-associated polynucleotide sequence.

In some embodiments, the Cas protein (e.g., Cas9) is complexed with the multiple ribonucleic acids. In some embodiments, the Cas protein and the multiple ribonucleic acids are contained in nanoparticles, as described herein. In some embodiments, the Cas protein and the multiple ribonucleic acids are contained in lipid nanoparticles, as described herein. In some embodiments, a nucleic acid encoding a Cas protein and the multiple ribonucleic acids are contained in nanoparticles. In some embodiments, a nucleic acid encoding a Cas protein and the multiple ribonucleic acids are contained in lipid nanoparticles, as described herein. In some embodiments, a modified, synthetic mRNA encoding a Cas protein as described herein, and multiple ribonucleic acids at least one of which comprises a modified, synthetic RNA as described herein, are contained in lipid nanoparticles. In some embodiments, a modified, synthetic mRNA encoding a Cas9 protein as described herein, and multiple ribonucleic acids at least one of which comprises a modified, synthetic RNA as described herein, are contained in lipid nanoparticles.

In some embodiments, the multiple ribonucleic acids are selected to minimize hybridization with nucleic acid sequences other than the target polynucleotide sequence (e.g., multiple alterations of a single target polynucleotide sequence). In some embodiments, the multiple ribonucleic acids are selected to minimize hybridization with nucleic acid sequences other than the target polynucleotide sequences (e.g., one or more alterations of multiple target polynucleotide sequences). In some embodiments, each of the multiple ribonucleic acids hybridize to target motifs that contain at least two mismatches when compared with all other genomic nucleotide sequences in the cell. In some embodiments, each of the multiple ribonucleic acids hybridize to target motifs that contain at least one mismatch when compared with all other genomic nucleotide sequences in the cell. In some embodiments, each of the multiple ribonucleic acids are designed to hybridize to target motifs immediately adjacent to deoxyribonucleic acid motifs recognized by the Cas protein. In some embodiments, each of the multiple ribonucleic acids are designed to hybridize to target motifs immediately adjacent to deoxyribonucleic acid motifs recognized by the Cas protein which flank mutant alleles located between the target motifs.

In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 1. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 1.

In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 2. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 2.

In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 3. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 3.

In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 4. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 4.

In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 5. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 5.

In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 6. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 6.

In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 7. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 7.

In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 8. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 8.

In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 9. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 9.

In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 9. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 9.

In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 10. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 10.

In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 11. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 11.

In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 12. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 12.

In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 13. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 13.

In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 14. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 14.

In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 15. In some embodiments, each of the multiple ribonucleic acids comprises a sequence with a single nucleotide mismatch to a different sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 15.

In some embodiments, each of the multiple ribonucleic acids comprises a different sequence selected from the group consisting of the ribonucleic acid sequences of FIGS. 1-15 and combinations thereof. In some embodiments, the different sequences of the multiple ribonucleic acids described above do not include the 3 nucleotide NGG sequence.

For example, if a target site sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), a sequence of the multiple ribonucleic acids is GATGCTCAGTACAGCCACCT (SEQ ID NO: 5324). As another example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), a sequence with a single nucleotide mismatch which does not include the 3 nucleotide NGG sequence is GATGCTGAGTACAGCCACCT (SEQ ID NO: 5326), with the italicized G being the mismatched nucleotide. Those skilled in the art will appreciate, however, that the single nucleotide mismatch can comprise any nucleotide in the ribonucleic acid, e.g., the first nucleotide, the second nucleotide, the third nucleotide, the fourth nucleotide, the fifth nucleotide, the sixth nucleotide, the seventh nucleotide, the eighth nucleotide, the ninth nucleotide, the tenth nucleotide, the eleventh nucleotide, the twelfth nucleotide, the thirteenth nucleotide, the fourteenth nucleotide, the fifteenth nucleotide, the sixteenth nucleotide, the seventeenth nucleotide, the eighteenth nucleotide, the nineteenth nucleotide, or the twentieth nucleotide of the ribonucleic acid.

In some embodiments, the different sequences of the multiple ribonucleic acids described above comprise at least 12 nucleotide fragments of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the different sequences of the multiple ribonucleic acids described above comprise at least 12 nucleotide fragments sequences with single nucleotide mismatches to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. For example, if a target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), a different sequence of the multiple ribonucleic acids comprising at least a 12 nucleotide fragment with single nucleotide mismatch comprises GTACAGCCACCT (SEQ ID NO: 5327).

In some embodiments, the different sequences of the multiple ribonucleic acids described above comprise at least 13 nucleotide fragments of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the different sequences of the multiple ribonucleic acids described above comprise at least 13 nucleotide fragments sequences with single nucleotide mismatches to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. For example, if a target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), a different sequence of the multiple ribonucleic acids comprising at least a 13 nucleotide fragment with single nucleotide mismatch comprises AGTACAGCCACCT (SEQ ID NO: 5328).

In some embodiments, the different sequences of the multiple ribonucleic acids described above comprise at least 14 nucleotide fragments of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the different sequences of the multiple ribonucleic acids described above comprise at least 14 nucleotide fragments sequences with single nucleotide mismatches to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. For example, if a target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), a different sequence of the multiple ribonucleic acids comprising at least a 14 nucleotide fragment with single nucleotide mismatch comprises CAGTACAGCCACCT (SEQ ID NO: 5329).

In some embodiments, the different sequences of the multiple ribonucleic acids described above comprise at least 15 nucleotide fragments of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the different sequences of the multiple ribonucleic acids described above comprise at least 15 nucleotide fragments sequences with single nucleotide mismatches to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. For example, if a target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), a different sequence of the multiple ribonucleic acids comprising at least a 15 nucleotide fragment with single nucleotide mismatch comprises TCAGTACAGCCACCT (SEQ ID NO: 5330).

In some embodiments, the different sequences of the multiple ribonucleic acids described above comprise at least 16 nucleotide fragments of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the different sequences of the multiple ribonucleic acids described above comprise at least 16 nucleotide fragments sequences with single nucleotide mismatches to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. For example, if a target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), a different sequence of the multiple ribonucleic acids comprising at least a 16 nucleotide fragment with single nucleotide mismatch comprises CTCAGTACAGCCACCT (SEQ ID NO: 5331).

In some embodiments, the different sequences of the multiple ribonucleic acids described above comprise at least 17 nucleotide fragments of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the different sequences of the multiple ribonucleic acids described above comprise at least 17 nucleotide fragments sequences with single nucleotide mismatches to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. For example, if a target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), a different sequence of the multiple ribonucleic acids comprising at least a 17 nucleotide fragment with single nucleotide mismatch comprises GCTCAGTACAGCCACCT (SEQ ID NO: 5332).

In some embodiments, the different sequences of the multiple ribonucleic acids described above comprise at least 18 nucleotide fragments of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the different sequences of the multiple ribonucleic acids described above comprise at least 18 nucleotide fragments sequences with single nucleotide mismatches to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. For example, if a target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), a different sequence of the multiple ribonucleic acids comprising at least a 18 nucleotide fragment with single nucleotide mismatch comprises TGCTCAGTACAGCCACCT (SEQ ID NO: 5333).

In some embodiments, the different sequences of the multiple ribonucleic acids described above comprise at least 19 nucleotide fragments of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the different sequences of the multiple ribonucleic acids described above comprise at least 19 nucleotide fragments sequences with single nucleotide mismatches to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. For example, if a target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), a different sequence of the multiple ribonucleic acids comprising at least a 19 nucleotide fragment with single nucleotide mismatch comprises ATGCTCAGTACAGCCACCT (SEQ ID NO: 5334).

In some embodiments, the different sequences of the multiple ribonucleic acids described above comprise at least 20 nucleotide fragments of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the different sequences of the multiple ribonucleic acids described above comprise at least 20 nucleotide fragments sequences with single nucleotide mismatches to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. For example, if a target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), a different sequence of the multiple ribonucleic acids comprising at least a 12 nucleotide fragment with single nucleotide mismatch comprises GATGCTCAGTACAGCCACCT (SEQ ID NO: 5324).

It should be appreciated that any of the Cas protein or the ribonucleic acids can be expressed from a plasmid. In some embodiments, any of the Cas protein or the ribonucleic acids are expressed using a promoter optimized for increased expression in stem cells (e.g., human stem cells). In some embodiments, the promoter is selected from the group consisting of a Cytomegalovirus (CMV) early enhancer element and a chicken beta-actin promoter, a chicken beta-actin promoter, an elongation factor-1 alpha promoter, and a ubiquitin promoter.

In some embodiments, the methods of the present invention further comprise selecting cells that express the Cas protein. The present invention contemplates any suitable method for selecting cells. In some embodiments, selecting cells comprises FACS. In some embodiments, FACs is used to select cells which co-express Cas and a fluorescent protein selected from the group consisting of green fluorescent protein and red fluorescent protein.

The present invention contemplates treating and/or preventing a variety of disorders which are associated with expression of a target polynucleotide sequences. It should be appreciated that the methods and compositions described herein can be used to treat or prevent disorders associated with increased expression of a target polynucleotide sequence, as well as decreased expression of a target polynucleotide sequence in a cell. Increased and decreased expression of a target polynucleotide sequence includes circumstances where the expression levels of the target polynucleotide sequence are increased or decreased, respectively, as well as circumstances in which the function and/or level of activity of an expression product of the target polynucleotide sequence increases or decreases, respectively, compared to normal expression and/or activity levels. Those skilled in the art will appreciate that treating or preventing a disorder associated with increased expression of a target polynucleotide sequence can be assessed by determining whether the levels and/or activity of the target polynucleotide sequence (or an expression product thereof) are decreased in a relevant cell after employing a method or administering a composition described herein. The skilled artisan will also appreciate that treating or preventing a disorder associated with decreased expression of a target polynucleotide sequence can be assessed by determining whether the levels and/or activity of the target polynucleotide sequence (or an expression product thereof) are increased in the relevant cell after employing a method or administering a composition described herein.

In some embodiments, the disorder is a genetic disorder. In some embodiments, the disorder is a monogenic disorder. In some embodiments, the disorder is a multigenic disorder. In some embodiments, the disorder is a disorder associated with one or more SNPs. Exemplary disorders associated with one or more SNPs include a complex disease described in U.S. Pat. No. 7,627,436, Alzheimer's disease as described in PCT International Application Publication No. WO/2009/112882, inflammatory diseases as described in U.S. Patent Application Publication No. 2011/0039918, polycystic ovary syndrome as described in U.S. Patent Application Publication No. 2012/0309642, cardiovascular disease as described in U.S. Pat. No. 7,732,139, Huntington's disease as described in U.S. Patent Application Publication No. 2012/0136039, thromboembolic disease as described in European Patent Application Publication No. EP2535424, neurovascular diseases as described in PCT International Application Publication No. WO/2012/001613, psychosis as described in U.S. Patent Application Publication No. 2010/0292211, multiple sclerosis as described in U.S. Patent Application Publication No. 2011/0319288, schizopherenia, schizoaffective disorder, and bipolar disorder as described in PCT International Application Publication No. WO/2006/023719A2, bipolar disorder and other ailments as described in U.S. Patent Application Publication No. U.s. 2011/0104674, colorectal cancer as described in PCT International Application Publication No. WO/2006/104370A1, a disorder associated with a SNP adjacent to the AKT1 gene locus as described in U.S. Patent Application Publication No. U.S. 2006/0204969, an eating disorder as described in PCT International Application Publication No. WO/2003/012143A1, autoimmune disease as described in U.S. Patent Application Publication No. U.S. 2007/0269827, fibrostenosing disease in patients with Chrohn's disease as described in U.S. Pat. No. 7,790,370, and Parkinson's disease as described in U.S. Pat. No. 8,187,811, each of which is incorporated herein by reference in its entirety. Other disorders associated with one or more SNPs which can be treated or prevented according to the methods of the present invention will be apparent to the skilled artisan.

In some embodiments, the disorder is severe combined immunodeficiency. In some embodiments, the disorder is X-linked moderate combined immunodeficiency. In some embodiments, the disorder is X-linked severe combined immunodeficiency. In some embodiments, the disorder is adenosine deaminase deficiency or SCID associated with adenosine deaminase deficiency. In some embodiments, the disorder is Athabascan type SCID. In some embodiments, the disorder is T cell-negative, B-cell/natural killer cell-positive SCID. In some embodiments, the disorder is T-negative/B-positive type autosomal recessive SCID. In some embodiments, the disorder is SCID with sensitivity to ionizing radiation. In some embodiments, the disorder is SCID with microcephaly, growth retardation, and sensitivity to ionizing radiation. In some embodiments, the disorder is reticular dysgenesis. In some embodiments, the disorder is LIG4 syndrome. In some embodiments, the disorder is alpha/beta T-cell lymphopenia with gamma/delta T-cell expansion, severe cytomegalovirus infection, and autoimmunity. In some embodiments, the disorder is combined cellular and humoral immune defects with granulomas. In some embodiments, the disorder is B cell-negative SCID. In some embodiments, the disorder is a selective T-cell defect or selective T-cell defect associated with SCID. In some embodiments, the disorder is purine nucleoside phosphorylase deficiency or SCID associated with purine nucleoside phosphorylase deficiency. In some embodiments, the disorder is Omenn syndrome. In some embodiments, the disorder is bare lymphocyte syndrome. In some embodiments, the disorder is SCID associated with JAK3 mutation. In some embodiments, the disorder is SCID associated with DCLRE1C mutation. In some embodiments, the disorder is a disorder listed in any one of the Gene Phenotype Relationship tables listed herein. In some embodiments, the disorder is sickle cell anemia. In some embodiments, the disorder is sickle cell disease. In some embodiments, the disorder is sickle cell anemia. In some embodiments, the disorder is sickle beta thalassemia. In some embodiments, the disorder is beta thalassemia.

The methods of the present invention are capable of altering target polynucleotide sequences in a variety of different cells. In some embodiments, the methods of the present invention are used to alter target polynucleotide sequences in cells ex vivo for subsequent introduction into a subject. In some embodiments, the methods of the present invention can be used to alter target polynucleotide sequences in cells in vivo. In some embodiments, the cell is a peripheral blood cell. In some embodiments, the cell is a stem cell or a pluripotent cell. In some embodiments, the cell is a hematopoietic stem cell. In some embodiments, the cell is a CD34+ cell. In some embodiments, the cell is a CD34+ mobilized peripheral blood cell. In some embodiments, the cell is a CD34+ cord blood cell. In some embodiments, the cell is a CD34+ bone marrow cell. In some embodiments, the cell is a CD34+CD38-Lineage-CD90+CD45RA− cell. In some embodiments, the cell is a hepatocyte. In some embodiments, the cell is a human pluripotent cell. In some embodiments, the cell is a primary human cell. In some embodiments, the cell is a non-transformed cell. In some embodiments, the cell is not a cancer cell. In some embodiments, the cell is not a tumor cell. In some embodiments, the cell is not a transformed cell.

In some aspects, the present invention provides a method for altering a target SCID-associated polynucleotide sequence in a cell comprising contacting the SCID-associated polynucleotide sequence in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCID-associated polynucleotide sequence, wherein the target SCID-associated polynucleotide sequence is cleaved, and wherein the efficiency of alteration of cells that express Cas protein is from about 8% to about 80%.

In some aspects, the present invention provides a method for altering a target SCD-associated polynucleotide sequence in a cell comprising contacting the SCD-associated polynucleotide sequence in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCD-associated polynucleotide sequence, wherein the target SCD-associated polynucleotide sequence is cleaved, and wherein the efficiency of alteration of cells that express Cas protein is from about 8% to about 80%.

In some aspects, the present invention provides a method for altering a target beta thalassemia-associated polynucleotide sequence in a cell comprising contacting the beta thalassemia-associated polynucleotide sequence in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target beta thalassemia-associated polynucleotide sequence, wherein the target beta thalassemia-associated polynucleotide sequence is cleaved, and wherein the efficiency of alteration of cells that express Cas protein is from about 8% to about 80%.

In some aspects, the present invention provides a method for treating or preventing a disorder associated with expression of a SCID-associated polynucleotide sequence in a subject, the method comprising (a) altering a target SCID-associated polynucleotide sequence in a cell ex vivo by contacting the SCID-associated polynucleotide sequence in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCID-associated polynucleotide sequence, wherein the target SCID-associated polynucleotide sequence is cleaved, and wherein the efficiency of alteration is from about 8% to about 80%, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the SCID-associated polynucleotide sequence.

In some embodiments, the method includes the step of contacting, before the step of introducing the cell into the subject, the cleaved SCID-associated polynucleotide sequences with an exogenously introduced DNA repair template comprising a corresponding normal or wild-type polynucleotide sequence, thereby allowing homology-directed repair to replace the cleaved SCID-associated polynucleotide sequence with the corresponding normal or wild type polynucleotide sequence. In embodiments in which the target SCID-associated polynucleotide sequences comprise ADA polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal ADA polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise AK2 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal AK2 polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise CD3D polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal CD3D polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise DCLRE1C polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal DCLRE1C polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise IL2RG polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal IL2RG polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise IL7R polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal IL7R polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise JAK3 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal JAK3 polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise LIG4 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal LIG4 polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise NHEJ1 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal NHEJ1 polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise PNP polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal PNP polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise PRKDC polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal PRKDC polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise RAG1 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal RAG1 polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise RAG2 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal RAG2 polynucleotide sequence.

In embodiments in which the target SCID-associated polynucleotide sequences comprise ZAP70 polynucleotide sequences, the exogenously introduced DNA repair template comprises a corresponding wild-type or normal ZAP70 polynucleotide sequence.

In some aspects, the present invention provides a method for treating or preventing a disorder associated with expression of a SCD-associated polynucleotide sequence in a subject, the method comprising (a) altering a target SCD-associated polynucleotide sequence in a cell ex vivo by contacting the SCD-associated polynucleotide sequence in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCD-associated polynucleotide sequence, wherein the target SCD-associated polynucleotide sequence is cleaved, and wherein the efficiency of alteration is from about 8% to about 80%, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the SCD-associated polynucleotide sequence.

In some embodiments, the method includes the step of contacting, before the step of introducing the cell into the subject, the cleaved SCD-associated polynucleotide sequences with an exogenously introduced DNA repair template comprising a normal HBB sequence, thereby allowing homology-directed repair to replace the cleaved SCD-associated polynucleotide sequence with the normal HBB sequence.

In some aspects, the present invention provides a method for treating or preventing a disorder associated with expression of a beta thalassemia-associated polynucleotide sequence in a subject, the method comprising (a) altering a target beta thalassemia-associated polynucleotide sequence in a cell ex vivo by contacting the beta thalassemia-associated polynucleotide sequence in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target beta thalassemia-associated polynucleotide sequence, wherein the target beta thalassemia-associated polynucleotide sequence is cleaved, and wherein the efficiency of alteration is from about 8% to about 80%, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the beta thalassemia-associated polynucleotide sequence.

In some embodiments, the method includes the step of contacting, before the step of introducing the cell into the subject, the cleaved beta thalassemia-associated polynucleotide sequences with an exogenously introduced DNA repair template comprising a normal HBB sequence, thereby allowing homology-directed repair to replace the cleaved beta thalassemia-associated polynucleotide sequence with the normal HBB sequence.

In some aspects, the present invention provides a method for simultaneously altering multiple target SCID-associated polynucleotide sequences in a cell comprising contacting the SCID-associated polynucleotide sequences in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target SCID-associated polynucleotide sequences, wherein the target SCD-associated polynucleotide sequences are cleaved, and wherein the efficiency of alteration of cells that express Cas protein is from about 8% to about 80%.

In some embodiments, the method includes the step of contacting the cleaved target SCID-associated polynucleotide sequences with an exogenously introduced DNA repair template comprising a corresponding normal or wild-type sequence (e.g., of a ADA gene, a AK2 gene, a CD3D gene, a DCLRE1C gene, a IL2RG gene, IL7R gene, a LIG4 gene, a NHEJ1 gene, a PNP gene, a PRKDC gene, a RAG1 gene, a RAG2 gene, a ZAP70 gene), thereby allowing homology-directed repair to replace the mutant portion of the cleaved SCID-associated polynucleotide sequence with the corresponding normal or wild-type sequence (e.g., of the ADA gene, the AK2 gene, the CD3D gene, the DCLRE1C gene, the IL2RG gene, the IL7R gene, the LIG4 gene, the NHEJ1 gene, the PNP gene, the PRKDC gene, the RAG1 gene, the RAG2 gene, the ZAP70 gene).

In some aspects, the present invention provides a method for simultaneously altering multiple target SCD-associated polynucleotide sequences in a cell comprising contacting the SCD-associated polynucleotide sequences in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target SCD-associated polynucleotide sequences, wherein the target SCD-associated polynucleotide sequences are cleaved, and wherein the efficiency of alteration of cells that express Cas protein is from about 8% to about 80%.

In some embodiments, the method includes the step of contacting the cleaved target SCD-associated polynucleotide sequences with an exogenously introduced DNA repair template comprising a normal HBB sequence, thereby allowing homology-directed repair to replace the mutant portion of the cleaved SCD-associated polynucleotide sequence with the normal HBB sequence.

In some aspects, the present invention provides a method for simultaneously altering multiple target beta thalassemia-associated polynucleotide sequences in a cell comprising contacting the beta thalassemia-associated polynucleotide sequences in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target beta thalassemia-associated polynucleotide sequences, wherein the target beta thalassemia-associated polynucleotide sequences are cleaved, and wherein the efficiency of alteration of cells that express Cas protein is from about 8% to about 80%.

In some embodiments, the method includes the step of contacting the cleaved target beta thalassemia-associated polynucleotide sequences with an exogenously introduced DNA repair template comprising a normal HBB sequence, thereby allowing homology-directed repair to replace the mutant portion of the cleaved beta thalassemia-associated polynucleotide sequence with the normal HBB sequence.

In some aspects, the present invention provides a method for treating or preventing a disorder associated with expression of SCID-associated polynucleotide sequences in a subject, the method comprising (a) altering target SCID-associated polynucleotide sequences in a cell ex vivo by contacting the SCD-associated polynucleotide sequences in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target SCID-associated polynucleotide sequences, wherein the target SCID-associated polynucleotide sequences are cleaved, and wherein the efficiency of alteration of cells that express Cas protein is from about 8% to about 80%, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the SCID-associated polynucleotide sequences.

In some embodiments, the method includes the step of contacting, before the step of introducing the cell into the subject, the cleaved SCID-associated polynucleotide sequences with an exogenously introduced DNA repair template comprising a corresponding normal or wild-type sequence (e.g., of a ADA gene, a AK2 gene, a CD3D gene, a DCLRE1C gene, a IL2RG gene, a IL7R gene, a LIG4 gene, a NHEJ1 gene, a PNP gene, a PRKDC gene, a RAG1 gene, a RAG2 gene, a ZAP70 gene), thereby allowing homology-directed repair to replace the cleaved SCD-associated polynucleotide sequence with the normal or wild-type sequence (e.g., of the ADA gene, the AK2 gene, the CD3D gene, the DCLRE1C gene, the IL2RG gene, the IL7R gene, the LIG4 gene, the NHEJ1 gene, the PNP gene, the PRKDC gene, the RAG1 gene, the RAG2 gene, the ZAP70 gene).

In some aspects, the present invention provides a method for treating or preventing a disorder associated with expression of SCD-associated polynucleotide sequences in a subject, the method comprising (a) altering target SCD-associated polynucleotide sequences in a cell ex vivo by contacting the SCD-associated polynucleotide sequences in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target SCD-associated polynucleotide sequences, wherein the target SCD-associated polynucleotide sequences are cleaved, and wherein the efficiency of alteration of cells that express Cas protein is from about 8% to about 80%, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the SCD-associated polynucleotide sequences.

In some embodiments, the method includes the step of contacting, before the step of introducing the cell into the subject, the cleaved SCD-associated polynucleotide sequences with an exogenously introduced DNA repair template comprising a normal HBB sequence, thereby allowing homology-directed repair to replace the cleaved SCD-associated polynucleotide sequence with the normal HBB sequence.

In some aspects, the present invention provides a method for treating or preventing a disorder associated with expression of beta thalassemia-associated polynucleotide sequences in a subject, the method comprising (a) altering target beta thalassemia-associated polynucleotide sequences in a cell ex vivo by contacting the beta thalassemia-associated polynucleotide sequences in a cell selected from the group consisting of a human pluripotent cell, a primary human cell, and a non-transformed human cell, with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and multiple ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to target motifs of the target beta thalassemia-associated polynucleotide sequences, wherein the target beta thalassemia-associated polynucleotide sequences are cleaved, and wherein the efficiency of alteration of cells that express Cas protein is from about 8% to about 80%, and (b) introducing the cell into the subject, thereby treating or preventing a disorder associated with expression of the beta thalassemia-associated polynucleotide sequences.

In some embodiments, the method includes the step of contacting, before the step of introducing the cell into the subject, the cleaved beta thalassemia-associated polynucleotide sequences with an exogenously introduced DNA repair template comprising a normal HBB sequence, thereby allowing homology-directed repair to replace the cleaved beta thalassemia-associated polynucleotide sequence with the normal HBB sequence.

The present invention also provides compositions comprising Cas proteins of the present invention or functional portions thereof, nucleic acids encoding the Cas proteins or functional portions thereof, and ribonucleic acid sequences which direct Cas proteins to and hybridize to target motifs of target polynucleotides in a cell.

For administration to a subject, a composition as disclosed herein can be administered to a subject, for example in pharmaceutically acceptable compositions. Pharmaceutically acceptable compositions comprise a therapeutically-effective amount of a Cas protein of the present invention or functional portion thereof, nucleic acids encoding the Cas proteins (e.g., modified, synthetic mRNA), and ribonucleic acid sequences which direct Cas proteins to and hybridize, formulated together with one or more pharmaceutically acceptable carriers (additives) and/or diluents.

As described in detail below, the pharmaceutical compositions of the present invention can be specially formulated for administration in solid or liquid form, including those adapted for the following: (1) oral administration, for example, drenches (aqueous or non-aqueous solutions or suspensions), lozenges, dragees, capsules, pills, tablets (e.g., those targeted for buccal, sublingual, and systemic absorption), boluses, powders, granules, pastes for application to the tongue; (2) parenteral administration, for example, by subcutaneous, intramuscular, intravenous or epidural injection as, for example, a sterile solution or suspension, or sustained-release formulation; (3) topical application, for example, as a cream, ointment, or a controlled-release patch or spray applied to the skin; (4) intravaginally or intrarectally, for example, as a pessary, cream or foam; (5) sublingually; (6) ocularly; (7) transdermally; (8) transmucosally; or (9) nasally. Additionally, compounds can be implanted into a patient or injected using a drug delivery system. See, for example, Urquhart, et al., Ann. Rev. Pharmacol. Toxicol. 24: 199-236 (1984); Lewis, ed. “Controlled Release of Pesticides and Pharmaceuticals” (Plenum Press, New York, 1981); U.S. Pat. No. 3,773,919; and U.S. Pat. No. 35 3,270,960.

As used here, the term “pharmaceutically acceptable” refers to those compounds, materials, compositions, and/or dosage forms which are, within the scope of sound medical judgment, suitable for use in contact with the tissues of human beings and animals without excessive toxicity, irritation, allergic response, or other problem or complication, commensurate with a reasonable benefit/risk ratio.

As used here, the term “pharmaceutically-acceptable carrier” means a pharmaceutically-acceptable material, composition or vehicle, such as a liquid or solid filler, diluent, excipient, manufacturing aid (e.g., lubricant, talc magnesium, calcium or zinc stearate, or steric acid), or solvent encapsulating material, involved in carrying or transporting the subject compound from one organ, or portion of the body, to another organ, or portion of the body. Each carrier must be “acceptable” in the sense of being compatible with the other ingredients of the formulation and not injurious to the patient. Some examples of materials which can serve as pharmaceutically-acceptable carriers include: (1) sugars, such as lactose, glucose and sucrose; (2) starches, such as corn starch and potato starch; (3) cellulose, and its derivatives, such as sodium carboxymethyl cellulose, methylcellulose, ethyl cellulose, microcrystalline cellulose and cellulose acetate; (4) powdered tragacanth; (5) malt; (6) gelatin; (7) lubricating agents, such as magnesium stearate, sodium lauryl sulfate and talc; (8) excipients, such as cocoa butter and suppository waxes; (9) oils, such as peanut oil, cottonseed oil, safflower oil, sesame oil, olive oil, corn oil and soybean oil; (10) glycols, such as propylene glycol; (11) polyols, such as glycerin, sorbitol, mannitol and polyethylene glycol (PEG); (12) esters, such as ethyl oleate and ethyl laurate; (13) agar; (14) buffering agents, such as magnesium hydroxide and aluminum hydroxide; (15) alginic acid; (16) pyrogen-free water; (17) isotonic saline; (18) Ringer's solution; (19) ethyl alcohol; (20) pH buffered solutions; (21) polyesters, polycarbonates and/or polyanhydrides; (22) bulking agents, such as polypeptides and amino acids (23) serum component, such as serum albumin, HDL and LDL; (22) C2-C12 alchols, such as ethanol; and (23) other non-toxic compatible substances employed in pharmaceutical formulations. Wetting agents, coloring agents, release agents, coating agents, sweetening agents, flavoring agents, perfuming agents, preservative and antioxidants can also be present in the formulation. The terms such as “excipient”, “carrier”, “pharmaceutically acceptable carrier” or the like are used interchangeably herein.

The phrase “therapeutically-effective amount” as used herein in respect to a Cas protein and/or ribonucleic acids described herein means that amount of relevant protein and/or ribonucleic acid, or composition comprising the same which is effective for producing some desired therapeutic effect in at least a sub-population of cells in an animal at a reasonable benefit/risk ratio applicable to any medical treatment. For example, an amount administered to a subject that is sufficient to produce a statistically significant, measurable change in at least one symptom of sickle cell anemia (e.g., reduced red blood cell count.). Determination of a therapeutically effective amount is well within the capability of those skilled in the art. Generally, a therapeutically effective amount can vary with the subject's history, age, condition, sex, as well as the severity and type of the medical condition in the subject, and administration of other pharmaceutically active agents.

As used herein, the term “administer” refers to the placement of a composition into a subject by a method or route which results in at least partial localization of the composition at a desired site such that desired effect is produced. A compound or composition described herein can be administered by any appropriate route known in the art including, but not limited to, oral or parenteral routes, including intravenous, intramuscular, subcutaneous, transdermal, airway (aerosol), pulmonary, nasal, rectal, and topical (including buccal and sublingual) administration.

Exemplary modes of administration include, but are not limited to, injection, infusion, instillation, inhalation, or ingestion. “Injection” includes, without limitation, intravenous, intramuscular, intraarterial, intrathecal, intraventricular, intracapsular, intraorbital, intracardiac, intradermal, intraperitoneal, transtracheal, subcutaneous, subcuticular, intraarticular, sub capsular, subarachnoid, intraspinal, intracerebro spinal, and intrasternal injection and infusion. In preferred embodiments, the compositions are administered by intravenous infusion or injection.

In some aspects, the present invention provides a composition comprising at least one ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 1.

In some aspects, the present invention provides a composition comprising at least one ribonucleic acid comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 1.

In some aspects, the present invention provides a composition comprising at least one ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 2.

In some aspects, the present invention provides a composition comprising at least one ribonucleic acid comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 2.

In some aspects, the present invention provides a composition comprising at least one ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 3. In some aspects, the present invention provides a composition comprising at least one ribonucleic acid comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 3.

In some aspects, the present invention provides a composition comprising at least one ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 4. In some aspects, the present invention provides a composition comprising at least one ribonucleic acid comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 4.

In some aspects, the present invention provides a composition comprising at least one ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 5. In some aspects, the present invention provides a composition comprising at least one ribonucleic acid comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 5.

In some aspects, the present invention provides a composition comprising at least one ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 6. In some aspects, the present invention provides a composition comprising at least one ribonucleic acid comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 6.

In some aspects, the present invention provides a composition comprising at least one ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 7. In some aspects, the present invention provides a composition comprising at least one ribonucleic acid comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 7.

In some aspects, the present invention provides a composition comprising at least one ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 8. In some aspects, the present invention provides a composition comprising at least one ribonucleic acid comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 8.

In some aspects, the present invention provides a composition comprising at least one ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 9. In some aspects, the present invention provides a composition comprising at least one ribonucleic acid comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 9.

In some aspects, the present invention provides a composition comprising at least one ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 10. In some aspects, the present invention provides a composition comprising at least one ribonucleic acid comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 10.

In some aspects, the present invention provides a composition comprising at least one ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 11. In some aspects, the present invention provides a composition comprising at least one ribonucleic acid comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 11.

In some aspects, the present invention provides a composition comprising at least one ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 12. In some aspects, the present invention provides a composition comprising at least one ribonucleic acid comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 12.

In some aspects, the present invention provides a composition comprising at least one ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 13. In some aspects, the present invention provides a composition comprising at least one ribonucleic acid comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 13.

In some aspects, the present invention provides a composition comprising at least one ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 14. In some aspects, the present invention provides a composition comprising at least one ribonucleic acid comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 14.

In some aspects, the present invention provides a composition comprising at least one ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 15. In some aspects, the present invention provides a composition comprising at least one ribonucleic acid comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 15.

In some aspects, the present invention provides a composition comprising at least one ribonucleic acid having a ribonucleic acid sequences of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some aspects, the present invention provides a composition comprising at least one ribonucleic acid comprising a sequence with a single nucleotide mismatch to ribonucleic acid sequences of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the at least one ribonucleic acid sequences described above do not include the 3 nucleotide NGG sequence.

For example, if the target site sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid sequence is GATGCTCAGTACAGCCACCT (SEQ ID NO: 5324). As another example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), a ribonucleic acid sequence with a single nucleotide mismatch which does not include the 3 nucleotide NGG sequence is GATGCTGAGTACAGCCACCT (SEQ ID NO: 5326), with the italicized G being the mismatched nucleotide. Those skilled in the art will appreciate, however, that the single nucleotide mismatch can comprise any nucleotide in the ribonucleic acid, e.g., the first nucleotide, the second nucleotide, the third nucleotide, the fourth nucleotide, the fifth nucleotide, the sixth nucleotide, the seventh nucleotide, the eighth nucleotide, the ninth nucleotide, the tenth nucleotide, the eleventh nucleotide, the twelfth nucleotide, the thirteenth nucleotide, the fourteenth nucleotide, the fifteenth nucleotide, the sixteenth nucleotide, the seventeenth nucleotide, the eighteenth nucleotide, the nineteenth nucleotide, or the twentieth nucleotide of the ribonucleic acid.

In some embodiments, the at least one ribonucleic acid described above comprises at least a 12 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the at least one ribonucleic acid described above comprises at least a 12 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the at least one ribonucleic acid described above comprises at least a 12 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the at least one ribonucleic acid described above comprises at least a 12 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of any of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 12 nucleotide fragment is GTACAGCCACCT (SEQ ID NO: 5327).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 13 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 13 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 13 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 13 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 13 nucleotide fragment is AGTACAGCCACCT (SEQ ID NO: 5328).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 14 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 14 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 14 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 14 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 14 nucleotide fragment is CAGTACAGCCACCT (SEQ ID NO: 5329).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 15 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 15 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 15 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 15 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 15 nucleotide fragment is TCAGTACAGCCACCT (SEQ ID NO: 5330).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprise at least a 16 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 16 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprise at least a 16 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 16 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 16 nucleotide fragment is CTCAGTACAGCCACCT (SEQ ID NO: 5331).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 17 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 17 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 17 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 17 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 17 nucleotide fragment is GCTCAGTACAGCCACCT (SEQ ID NO: 5332).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 18 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 18 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 18 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 18 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 18 nucleotide fragment is TGCTCAGTACAGCCACCT (SEQ ID NO: 5333).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprise at least a 19 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 19 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprise at least a 19 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 19 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 19 nucleotide fragment is ATGCTCAGTACAGCCACCT (SEQ ID NO: 5334).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 20 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 20 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 20 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 20 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 20 nucleotide fragment is GATGCTCAGTACAGCCACCT (SEQ ID NO: 5324).

In some embodiments, the at least one ribonucleic acid in the composition is contained in a nanoparticle. In some embodiments, the at least one ribonucleic acid is contained in a lipid nanoparticle. In some embodiments, the lipid nanoparticle comprises at least one of a cationic lipid, a neutral lipid, an amino lipid, a sterol, and a PEG or PEG-modified lipid.

In some embodiments, at least one of the ribonucleic acids in the composition is a modified ribonucleic acid as described herein (e.g., a synthetic, modified ribonucleic acid, e.g., comprising one to two modified nucleotides selected from the group consisting of pseudouridine, 5-methylcytodine, 2-thio-uridine, 5-methyluridine-5′-triphosphate, 4-thiouridine-5′-triphosphate, 5,6-dihydrouridine-5′-triphosphate, and 5-azauridine-5′-triphosphate, or any other modified nucleotides or modifications described herein).

In some embodiments, a composition of the present invention comprises a nucleic acid sequence encoding a Cas protein. In some embodiments, a composition of the present invention comprises nucleic acid sequence encoding Cas9 protein or a functional portion thereof.

In some embodiments, the nucleic acid encoding the Cas protein (e.g., Cas9) comprises a modified ribonucleic acid as described herein (e.g., a synthetic, modified mRNA described herein, e.g., comprising at least one modified nucleotide selected from the group consisting of pseudouridine, 5-methylcytodine, 2-thio-uridine, 5-methyluridine-5′-triphosphate, 4-thiouridine-5′-triphosphate, 5,6-dihydrouridine-5′-triphosphate, and 5-azauridine-5′-triphosphate or any other modified nucleotides or modifications described herein).

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 1.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 2.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 3.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 4.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 5.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 6.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 7.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 8.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 9.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 10.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 11.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 12.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 13.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 14.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid having a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 15.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid having a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321).

In some embodiments, the at least one additional ribonucleic acid sequences described above do not include the 3 nucleotide NGG sequence.

For example, if the target site sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid is GATGCTCAGTACAGCCACCT (SEQ ID NO: 5324). As another example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), a ribonucleic acid sequence with a single nucleotide mismatch which does not include the 3 nucleotide NGG sequence is GATGCTGAGTACAGCCACCT (SEQ ID NO: 5326), with the italicized G being the mismatched nucleotide. Those skilled in the art will appreciate, however, that the single nucleotide mismatch can comprise any nucleotide in the ribonucleic acid, e.g., the first nucleotide, the second nucleotide, the third nucleotide, the fourth nucleotide, the fifth nucleotide, the sixth nucleotide, the seventh nucleotide, the eighth nucleotide, the ninth nucleotide, the tenth nucleotide, the eleventh nucleotide, the twelfth nucleotide, the thirteenth nucleotide, the fourteenth nucleotide, the fifteenth nucleotide, the sixteenth nucleotide, the seventeenth nucleotide, the eighteenth nucleotide, the nineteenth nucleotide, or the twentieth nucleotide of the ribonucleic acid.

In some embodiments, the at least one additional ribonucleic acid described above comprises at least a 12 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the at least one additional ribonucleic acid described above comprises at least a 12 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the at least one additional ribonucleic acid described above comprises at least a 12 nucleotide fragment of a ribonucleic acid sequence of any of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the at least one additional ribonucleic acid described above comprises at least a 12 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid which comprises at least a 12 nucleotide fragment is GTACAGCCACCT (SEQ ID NO: 5327).

In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 13 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 13 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 13 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 13 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid which comprises at least a 13 nucleotide fragment is AGTACAGCCACCT (SEQ ID NO: 5328).

In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 14 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 14 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 14 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 14 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid which comprises at least a 14 nucleotide fragment is CAGTACAGCCACCT (SEQ ID NO: 5329).

In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 15 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 15 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 15 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 15 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid which comprises at least a 15 nucleotide fragment is TCAGTACAGCCACCT (SEQ ID NO: 5330).

In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprise at least a 16 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 16 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprise at least a 16 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 16 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid which comprises at least a 16 nucleotide fragment is CTCAGTACAGCCACCT (SEQ ID NO: 5331).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 17 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 17 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 17 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 17 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid which comprises at least a 17 nucleotide fragment is GCTCAGTACAGCCACCT (SEQ ID NO: 5332).

In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 18 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 18 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 18 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 18 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid which comprises at least a 18 nucleotide fragment is TGCTCAGTACAGCCACCT (SEQ ID NO: 5333).

In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprise at least a 19 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 19 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprise at least a 19 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 19 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid which comprises at least a 19 nucleotide fragment is ATGCTCAGTACAGCCACCT (SEQ ID NO: 5334).

In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 20 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 20 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 20 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 20 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid which comprises at least a 20 nucleotide fragment is GATGCTCAGTACAGCCACCT (SEQ ID NO: 5324).

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid sequence comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 1.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid sequence comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 2.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid sequence comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 3.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid sequence comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 4.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid sequence comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 5.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid sequence comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 6.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid sequence comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 7.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid sequence comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 8.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid sequence comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 9.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid sequence comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 10.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid sequence comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 11.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid sequence comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 12.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid sequence comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 13.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid sequence comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 14.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid sequence comprising a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 15.

In some aspects, the present invention provides a composition comprising a chimeric nucleic acid comprising a ribonucleic acid encoding a Cas protein and at least one additional ribonucleic acid sequence comprising a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321).

In some embodiments, the at least one additional ribonucleic acid sequences described above do not include the 3 nucleotide NGG sequence. For example, if the target site sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid is GATGCTCAGTACAGCCACCT (SEQ ID NO: 5324). As another example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), a ribonucleic acid sequence with a single nucleotide mismatch which does not include the 3 nucleotide NGG sequence is GATGCTGAGTACAGCCACCT (SEQ ID NO: 5326), with the italicized G being the mismatched nucleotide. Those skilled in the art will appreciate, however, that the single nucleotide mismatch can comprise any nucleotide in the ribonucleic acid, e.g., the first nucleotide, the second nucleotide, the third nucleotide, the fourth nucleotide, the fifth nucleotide, the sixth nucleotide, the seventh nucleotide, the eighth nucleotide, the ninth nucleotide, the tenth nucleotide, the eleventh nucleotide, the twelfth nucleotide, the thirteenth nucleotide, the fourteenth nucleotide, the fifteenth nucleotide, the sixteenth nucleotide, the seventeenth nucleotide, the eighteenth nucleotide, the nineteenth nucleotide, or the twentieth nucleotide of the ribonucleic acid.

In some embodiments, the at least one additional ribonucleic acid described above comprises at least a 12 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the at least one additional ribonucleic acid described above comprises at least a 12 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the at least one additional ribonucleic acid described above comprises at least a 12 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the at least one additional ribonucleic acid described above comprises at least a 12 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid which comprises at least a 12 nucleotide fragment is GTACAGCCACCT (SEQ ID NO: 5327).

In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 13 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 13 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 13 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 13 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid which comprises at least a 13 nucleotide fragment is AGTACAGCCACCT (SEQ ID NO: 5328).

In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 14 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 14 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 14 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 14 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid which comprises at least a 14 nucleotide fragment is CAGTACAGCCACCT (SEQ ID NO: 5329).

In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 15 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 15 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 15 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 15 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid which comprises at least a 15 nucleotide fragment is TCAGTACAGCCACCT (SEQ ID NO: 5330).

In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprise at least a 16 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 16 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprise at least a 16 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 16 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid which comprises at least a 16 nucleotide fragment is CTCAGTACAGCCACCT (SEQ ID NO: 5331).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 17 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 17 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 17 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 17 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid which comprises at least a 17 nucleotide fragment is GCTCAGTACAGCCACCT (SEQ ID NO: 5332).

In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 18 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 18 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 18 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 18 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid which comprises at least a 18 nucleotide fragment is TGCTCAGTACAGCCACCT (SEQ ID NO: 5333).

In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprise at least a 19 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 19 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprise at least a 19 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 19 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid which comprises at least a 19 nucleotide fragment is ATGCTCAGTACAGCCACCT (SEQ ID NO: 5334).

In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 20 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 20 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 20 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one additional ribonucleic acid described above comprises at least a 20 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one additional ribonucleic acid which comprises at least a 20 nucleotide fragment is GATGCTCAGTACAGCCACCT (SEQ ID NO: 5324).

In some embodiments, a composition of the present invention comprises a nucleic acid sequence encoding a fluorescent protein selected from the group consisting of green fluorescent protein and red fluorescent protein. In some embodiments, a composition of the present invention comprises a promoter operably linked to the chimeric nucleic acid. In some embodiments, the promoter is optimized for increased expression in human stem cells. In some embodiments, the promoter is selected from the group consisting of a Cytomegalovirus (CMV) early enhancer element and a chicken beta-actin promoter, a chicken beta-actin promoter, an elongation factor-1 alpha promoter, and a ubiquitin promoter.

In some embodiments, the chimeric nucleic acid is contained in a nanoparticle. In some embodiments, the chimeric nucleic acid is contained in a lipid nanoparticle as described herein. In some embodiments, the chimeric nucleic acid comprises at least one modified nucleotide described herein. In some embodiments, the Cas protein comprises a Cas9 protein or a functional portion thereof.

For in vivo methods, a therapeutically effective amount of a composition described herein can be administered to a subject. Methods of administering compositions to a subject are known in the art and easily available to one of skill in the art.

In some embodiments, a composition described herein includes one or more additional pharmaceutically active agents for treating or preventing the disorder associated with expression of the target polynucleotide sequence.

The present invention also provides kits for practicing any of the methods of the present invention, as well as kits comprising the compositions of the present invention, and instructions for using the kits for altering target polynucleotide sequences in a cell.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 1.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 2.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 3.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 4.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 5.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 6.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 7.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 8.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 9.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 10.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 11.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 12.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 13.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 14.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 15.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence having a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321).

In some embodiments, the at least one ribonucleic acid sequences described above do not include the 3 nucleotide NGG sequence. For example, if the target site sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid sequence is GATGCTCAGTACAGCCACCT (SEQ ID NO: 5324). As another example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), a ribonucleic acid sequence with a single nucleotide mismatch which does not include the 3 nucleotide NGG sequence is GATGCTGAGTACAGCCACCT (SEQ ID NO: 5326), with the italicized G being the mismatched nucleotide. Those skilled in the art will appreciate, however, that the single nucleotide mismatch can comprise any nucleotide in the ribonucleic acid, e.g., the first nucleotide, the second nucleotide, the third nucleotide, the fourth nucleotide, the fifth nucleotide, the sixth nucleotide, the seventh nucleotide, the eighth nucleotide, the ninth nucleotide, the tenth nucleotide, the eleventh nucleotide, the twelfth nucleotide, the thirteenth nucleotide, the fourteenth nucleotide, the fifteenth nucleotide, the sixteenth nucleotide, the seventeenth nucleotide, the eighteenth nucleotide, the nineteenth nucleotide, or the twentieth nucleotide of the ribonucleic acid.

In some embodiments, the at least one ribonucleic acid described above comprises at least a 12 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the at least one ribonucleic acid described above comprises at least a 12 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the at least one ribonucleic acid described above comprises at least a 12 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the at least one ribonucleic acid described above comprises at least a 12 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 12 nucleotide fragment is GTACAGCCACCT (SEQ ID NO: 5327).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 13 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 13 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 13 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 13 nucleotide fragment of a sequence with a single nucleotide mismatch to f a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 13 nucleotide fragment is AGTACAGCCACCT (SEQ ID NO: 5328).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 14 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 14 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 14 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 14 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 14 nucleotide fragment is CAGTACAGCCACCT (SEQ ID NO: 5329).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 15 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 15 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 15 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 15 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 15 nucleotide fragment is TCAGTACAGCCACCT (SEQ ID NO: 5330).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprise at least a 16 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 16 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprise at least a 16 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 16 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 16 nucleotide fragment is CTCAGTACAGCCACCT (SEQ ID NO: 5331).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 17 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 17 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 17 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 17 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 17 nucleotide fragment is GCTCAGTACAGCCACCT (SEQ ID NO: 5332).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 18 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 18 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 18 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 18 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 18 nucleotide fragment is TGCTCAGTACAGCCACCT (SEQ ID NO: 5333).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprise at least a 19 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 19 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprise at least a 19 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 19 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 19 nucleotide fragment is ATGCTCAGTACAGCCACCT (SEQ ID NO: 5334).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 20 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 20 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 20 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 20 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 20 nucleotide fragment is GATGCTCAGTACAGCCACCT (SEQ ID NO: 5324).

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 1, and any combination thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 2, and any combination thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 3, and any combination thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 4, and any combination thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 5, and any combination thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 6, and any combination thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 7, and any combination thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 8, and any combination thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 9, and any combination thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 10, and any combination thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 11, and any combination thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 12, and any combination thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 13, and any combination thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 14, and any combination thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence selected from the group consisting of the ribonucleic acid sequences of FIG. 15, and any combination thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321).

In some embodiments, the at least one ribonucleic acid sequences with the single nucleotide mismatches described above do not include the 3 nucleotide NGG sequence. For example, if the target site sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid sequence is GATGCTCAGTACAGCCACCT (SEQ ID NO: 5324). As another example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), a ribonucleic acid sequence with a single nucleotide mismatch which does not include the 3 nucleotide NGG sequence is GATGCTGAGTACAGCCACCT (SEQ ID NO: 5326), with the italicized G being the mismatched nucleotide. Those skilled in the art will appreciate, however, that the single nucleotide mismatch can comprise any nucleotide in the ribonucleic acid, e.g., the first nucleotide, the second nucleotide, the third nucleotide, the fourth nucleotide, the fifth nucleotide, the sixth nucleotide, the seventh nucleotide, the eighth nucleotide, the ninth nucleotide, the tenth nucleotide, the eleventh nucleotide, the twelfth nucleotide, the thirteenth nucleotide, the fourteenth nucleotide, the fifteenth nucleotide, the sixteenth nucleotide, the seventeenth nucleotide, the eighteenth nucleotide, the nineteenth nucleotide, or the twentieth nucleotide of the ribonucleic acid.

In some embodiments, the at least one ribonucleic acid described above comprises at least a 12 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the at least one ribonucleic acid described above comprises at least a 12 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the at least one ribonucleic acid described above comprises at least a 12 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the at least one ribonucleic acid described above comprises at least a 12 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 12 nucleotide fragment is GTACAGCCACCT (SEQ ID NO: 5327).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 13 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 13 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 13 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 13 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 13 nucleotide fragment is AGTACAGCCACCT (SEQ ID NO: 5328).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 14 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 14 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 14 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 14 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 14 nucleotide fragment is CAGTACAGCCACCT (SEQ ID NO: 5329).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 15 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 15 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 15 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 15 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 15 nucleotide fragment is TCAGTACAGCCACCT (SEQ ID NO: 5330).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprise at least a 16 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 16 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprise at least a 16 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 16 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 16 nucleotide fragment is CTCAGTACAGCCACCT (SEQ ID NO: 5331).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 17 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 17 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 17 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 17 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 17 nucleotide fragment is GCTCAGTACAGCCACCT (SEQ ID NO: 5332).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 18 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 18 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 18 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 18 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 18 nucleotide fragment is TGCTCAGTACAGCCACCT (SEQ ID NO: 5333).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprise at least a 19 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 19 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprise at least a 19 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 19 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 19 nucleotide fragment is ATGCTCAGTACAGCCACCT (SEQ ID NO: 5334).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 20 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 20 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 20 nucleotide fragment of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 20 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 20 nucleotide fragment is GATGCTCAGTACAGCCACCT (SEQ ID NO: 5324).

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of at least one ribonucleic acid sequence of FIG. 1 and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence of FIG. 1, and combinations thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of at least one ribonucleic acid sequence of FIG. 2 and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence of FIG. 2, and combinations thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of at least one ribonucleic acid sequence of FIG. 3 and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence of FIG. 3, and combinations thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of at least one ribonucleic acid sequence of FIG. 4 and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence of FIG. 4, and combinations thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of at least one ribonucleic acid sequence of FIG. 5 and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence of FIG. 5, and combinations thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of at least one ribonucleic acid sequence of FIG. 6 and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence of FIG. 6, and combinations thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of at least one ribonucleic acid sequence of FIG. 7 and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence of FIG. 7, and combinations thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of at least one ribonucleic acid sequence of FIG. 8 and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence of FIG. 8, and combinations thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of at least one ribonucleic acid sequence of FIG. 9 and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence of FIG. 9, and combinations thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of at least one ribonucleic acid sequence of FIG. 10 and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence of FIG. 10, and combinations thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of at least one ribonucleic acid sequence of FIG. 11 and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence of FIG. 11, and combinations thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of at least one ribonucleic acid sequence of FIG. 12 and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence of FIG. 12, and combinations thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of at least one ribonucleic acid sequence of FIG. 13 and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence of FIG. 13, and combinations thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of at least one ribonucleic acid sequence of FIG. 14 and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence of FIG. 14, and combinations thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of at least one ribonucleic acid sequence of FIG. 15 and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence of FIG. 15, and combinations thereof.

In some aspects, the present invention comprises a kit for altering a target polynucleotide sequence in a cell comprising a Cas9 protein or a nucleic acid encoding the Cas9 protein, and at least one ribonucleic acid sequence selected from the group consisting of at least one ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321) and at least one ribonucleic acid sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321).

In some embodiments, the at least one ribonucleic acid sequences described above do not include the 3 nucleotide NGG sequence. For example, if the target site sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid sequence is GATGCTCAGTACAGCCACCT (SEQ ID NO: 5324). As another example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), a ribonucleic acid sequence with a single nucleotide mismatch which does not include the 3 nucleotide NGG sequence is GATGCTGAGTACAGCCACCT (SEQ ID NO: 5326), with the italicized G being the mismatched nucleotide. Those skilled in the art will appreciate, however, that the single nucleotide mismatch can comprise any nucleotide in the ribonucleic acid, e.g., the first nucleotide, the second nucleotide, the third nucleotide, the fourth nucleotide, the fifth nucleotide, the sixth nucleotide, the seventh nucleotide, the eighth nucleotide, the ninth nucleotide, the tenth nucleotide, the eleventh nucleotide, the twelfth nucleotide, the thirteenth nucleotide, the fourteenth nucleotide, the fifteenth nucleotide, the sixteenth nucleotide, the seventeenth nucleotide, the eighteenth nucleotide, the nineteenth nucleotide, or the twentieth nucleotide of the ribonucleic acid.

In some embodiments, the at least one ribonucleic acid described above comprises at least a 12 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the at least one ribonucleic acid described above comprises at least a 12 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 12 nucleotide fragment is GTACAGCCACCT (SEQ ID NO: 5327).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 13 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 13 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 13 nucleotide fragment is AGTACAGCCACCT (SEQ ID NO: 5328).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 14 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 14 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 14 nucleotide fragment is CAGTACAGCCACCT (SEQ ID NO: 5329).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 15 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 15 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 15 nucleotide fragment is TCAGTACAGCCACCT (SEQ ID NO: 5330).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprise at least a 16 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 16 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 16 nucleotide fragment is CTCAGTACAGCCACCT (SEQ ID NO: 5331).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 17 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 17 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 17 nucleotide fragment is GCTCAGTACAGCCACCT (SEQ ID NO: 5332).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 18 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 18 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 18 nucleotide fragment is TGCTCAGTACAGCCACCT (SEQ ID NO: 5333).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprise at least a 19 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 19 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 19 nucleotide fragment is ATGCTCAGTACAGCCACCT (SEQ ID NO: 5334).

In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 20 nucleotide fragment of a ribonucleic acid sequence of any of FIGS. 1-15. In some embodiments, the ribonucleic acid sequence of the at least one ribonucleic acid described above comprises at least a 20 nucleotide fragment of a sequence with a single nucleotide mismatch to a sequence selected from the group consisting of a ribonucleic acid sequence of any of FIGS. 1-15. For example, if the target sequence is GATGCTCAGTACAGCCACCTTGG (SEQ ID NO: 5323), the ribonucleic acid sequence of the at least one ribonucleic acid which comprises at least a 20 nucleotide fragment is GATGCTCAGTACAGCCACCT (SEQ ID NO: 5324).

In some embodiments, the at least one ribonucleic acid described above comprises at least a 12 nucleotide fragment, at least a 13 nucleotide fragment, at least a 14 nucleotide fragment, at least a 15 nucleotide fragment, at least a 16 nucleotide fragment, at least a 17 nucleotide fragment, at least an 18 nucleotide fragment, at least a 19 nucleotide fragment, or at least a 20 nucleotide sequence of a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321). In some embodiments, the at least one ribonucleic acid described above comprises at least a 12, at least a 13, at least a 14, at least a 15, at least a, at least a 17, at least an 18, at least a 19, or at least a 20 nucleotide fragment of a sequence with a single nucleotide mismatch to a ribonucleic acid sequence of GTAACGGCAGACTTCTCCACAGG (SEQ ID NO: 5321).

In some embodiments, the kit comprises one or more cell lines, cultures, or populations selected from the group consisting of human pluripotent cells, primary human cells, and non-transformed cells. In some embodiments, the kit comprises a DNA repair template.

In some embodiments, the DNA repair template comprises one or more normal or wild-type ADA gene sequences. In some embodiments, the DNA repair template comprises one or more normal or wild-type DNA sequences that correspond to mutant ADA sequences to be cleaved from target SCID-associated polynucleotide sequences.

In some embodiments, the DNA repair template comprises one or more normal or wild-type AK2 gene sequences. In some embodiments, the DNA repair template comprises one or more normal or wild-type DNA sequences that correspond to mutant AK2 sequences to be cleaved from target SCID-associated polynucleotide sequences.

In some embodiments, the DNA repair template comprises one or more normal or wild-type CD3D gene sequences. In some embodiments, the DNA repair template comprises one or more normal or wild-type DNA sequences that correspond to mutant CD3D sequences to be cleaved from target SCID-associated polynucleotide sequences.

In some embodiments, the DNA repair template comprises one or more normal or wild-type DCLRE1C gene sequences. In some embodiments, the DNA repair template comprises one or more normal or wild-type DNA sequences that correspond to mutant DCLRE1C sequences to be cleaved from target SCID-associated polynucleotide sequences.

In some embodiments, the DNA repair template comprises one or more normal or wild-type IL2RG gene sequences. In some embodiments, the DNA repair template comprises one or more normal or wild-type DNA sequences that correspond to mutant IL2RG sequences to be cleaved from target SCID-associated polynucleotide sequences.

In some embodiments, the DNA repair template comprises one or more normal or wild-type IL7R gene sequences. In some embodiments, the DNA repair template comprises one or more normal or wild-type DNA sequences that correspond to mutant IL7R sequences to be cleaved from target SCID-associated polynucleotide sequences.

In some embodiments, the DNA repair template comprises one or more normal or wild-type JAK3 gene sequences. In some embodiments, the DNA repair template comprises one or more normal or wild-type DNA sequences that correspond to mutant JAK3 sequences to be cleaved from target SCID-associated polynucleotide sequences.

In some embodiments, the DNA repair template comprises one or more normal or wild-type NHEJ1 gene sequences. In some embodiments, the DNA repair template comprises one or more normal or wild-type DNA sequences that correspond to mutant NHEJ1 sequences to be cleaved from target SCID-associated polynucleotide sequences.

In some embodiments, the DNA repair template comprises one or more normal or wild-type PNP gene sequences. In some embodiments, the DNA repair template comprises one or more normal or wild-type DNA sequences that correspond to mutant PNP sequences to be cleaved from target SCID-associated polynucleotide sequences.

In some embodiments, the DNA repair template comprises one or more normal or wild-type PRKDC gene sequences. In some embodiments, the DNA repair template comprises one or more normal or wild-type DNA sequences that correspond to mutant PRKDC sequences to be cleaved from target SCID-associated polynucleotide sequences.

In some embodiments, the DNA repair template comprises one or more normal or wild-type RAG1 gene sequences. In some embodiments, the DNA repair template comprises one or more normal or wild-type DNA sequences that correspond to mutant RAG1 sequences to be cleaved from target SCID-associated polynucleotide sequences.

In some embodiments, the DNA repair template comprises one or more normal or wild-type RAG2 gene sequences. In some embodiments, the DNA repair template comprises one or more normal or wild-type DNA sequences that correspond to mutant RAG2 sequences to be cleaved from target SCID-associated polynucleotide sequences.

In some embodiments, the DNA repair template comprises one or more normal or wild-type ZAP70 gene sequences. In some embodiments, the DNA repair template comprises one or more normal or wild-type DNA sequences that correspond to mutant ZAP70 sequences to be cleaved from target SCID-associated polynucleotide sequences.

In some embodiments, the DNA repair template comprises one or more normal or wild-type HBB gene sequences. In some embodiments, the DNA repair template comprises one or more normal or wild-type DNA sequences that correspond to mutant HBB sequences to be cleaved from target SCD-associated polynucleotide sequences.

In some embodiments, the DNA repair template comprises one or more normal or wild-type DNA sequences that correspond to mutant HBB sequences to be cleaved from target beta thalassemia-associated polynucleotide sequences.

It should be appreciated that the methods, compositions, and kits of the present invention may employ nanoparticles or lipid nanoparticles as a vehicle for delivering, or introducing a Cas protein and/or a ribonucleic acid of the present invention into a cell.

In some embodiments, the lipid nanoparticle comprises at least one of a cationic lipid, a neutral lipid, an amino lipid, a sterol, and a PEG or PEG-modified lipid.

The cationic lipid may be, for example, N,N-dioleyl-N,N-dimethylammonium chloride (DODAC), N,N-distearyl-N,N-dimethylammonium bromide (DDAB), N-(I-(2,3-dioleoyloxy)propyl)-N,N,N-trimethylammonium chloride (DOTAP), N-(I-(2,3-dioleyloxy)propyl)-N,N,N-trimethylammonium chloride (DOTMA), N,N-dimethyl-2,3-dioleyloxy)propylamine (DODMA), 1,2-DiLinoleyloxy-N,N-dimethylaminopropane (DLinDMA), 1,2-Dilinolenyloxy-N,N-dimethylaminopropane (DLenDMA), 1,2-Dilinoleylcarbamoyloxy-3-dimethylaminopropane (DLin-C-DAP), 1,2-Dilinoleyoxy-3-(dimethylamino)acetoxypropane (DLin-DAC), 1,2-Dilinoleyoxy-3-morpholinopropane (DLin-MA), 1,2-Dilinoleoyl-3-dimethylaminopropane (DLinDAP), 1,2-Dilinoleylthio-3-dimethylaminopropane (DLin-S-DMA), 1-Linoleoyl-2-linoleyloxy-3-dimethylaminopropane (DLin-2-DMAP), 1,2-Dilinoleyloxy-3-trimethylaminopropane chloride salt (DLin-TMA.Cl), 1,2-Dilinoleoyl-3-trimethylaminopropane chloride salt (DLin-TAP.Cl), 1,2-Dilinoleyloxy-3-(N-methylpiperazino)propane (DLin-MPZ), or 3-(N,N-Dilinoleylamino)-1,2-propanediol (DLinAP), 3-(N,N-Dioleylamino)-1,2-propanedio (DOAP), 1,2-Dilinoleyloxo-3-(2-N,N-dimethylamino)ethoxypropane (DLin-EG-DMA), 1,2-Dilinolenyloxy-N,N-dimethylaminopropane (DLinDMA), 2,2-Dilinoleyl-4-dimethylaminomethyl-[1,3]-dioxolane (DLin-K-DMA) or analogs thereof, (3aR,5s,6aS)-N,N-dimethyl-2,2-di((9Z,12Z)-octadeca-9,12-dienyl)tetrahydro--3aH-cyclopenta[d][1,3]dioxol-5-amine (ALNY-100), (6Z,9Z,28Z,31Z)-heptatriaconta-6,9,28,31-tetraen-19-yl 4-(dimethylamino)butanoate (MC3), or a mixture thereof.

Other cationic lipids, which carry a net positive charge at about physiological pH, in addition to those specifically described above, may also be included in lipid particles of the invention. Such cationic lipids include, but are not limited to, N,N-dioleyl-N,N-dimethylammonium chloride (“DODAC”); N-(2,3-dioleyloxy)propyl-N,N-N-triethylammonium chloride (“DOTMA”); N,N-distearyl-N,N-dimethylammonium bromide (“DDAB”); N-(2,3-dioleoyloxy)propyl)-N,N,N-trimethylammonium chloride (“DOTAP”); 1,2-Dioleyloxy-3-trimethylaminopropane chloride salt (“DOTAP.Cl”); 3.beta.-(N—(N′,N′-dimethylaminoethane)-carbamoyl)cholesterol (“DC-Chol”), N-(1-(2,3-dioleyloxy)propyl)-N-2-(sperminecarboxamido)ethyl)-N,N-dimethyl-ammonium trifluoracetate (“DOSPA”), dioctadecylamidoglycyl carboxyspermine (“DOGS”), 1,2-dileoyl-sn-3-phosphoethanolamine (“DOPE”), 1,2-dioleoyl-3-dimethylammonium propane (“DODAP”), N,N-dimethyl-2,3-dioleyloxy)propylamine (“DODMA”), and N-(1,2-dimyristyloxyprop-3-yl)-N,N-dimethyl-N-hydroxyethyl ammonium bromide (“DMRIE”), and mixtures thereof. Additionally, a number of commercial preparations of cationic lipids can be used, such as, e.g., LIPOFECTIN (including DOTMA and DOPE, available from GIBCO/BRL), and LIPOFECTAMINE (comprising DOSPA and DOPE, available from GIBCO/BRL). In particular embodiments, a cationic lipid is an amino lipid.

As used herein, the term “amino lipid” is meant to include those lipids having one or two fatty acid or fatty alkyl chains and an amino head group (including an alkylamino or dialkylamino group) that may be protonated to form a cationic lipid at physiological pH.

Other amino lipids would include those having alternative fatty acid groups and other dialkylamino groups, including those in which the alkyl substituents are different (e.g., N-ethyl-N-methylamino-, N-propyl-N-ethylamino- and the like). For those embodiments in which R¹¹ and R¹² are both long chain alkyl or acyl groups, they can be the same or different. In general, amino lipids having less saturated acyl chains are more easily sized, particularly when the complexes must be sized below about 0.3 microns, for purposes of filter sterilization. Amino lipids containing unsaturated fatty acids with carbon chain lengths in the range of C14 to C22 are preferred. Other scaffolds can also be used to separate the amino group and the fatty acid or fatty alkyl portion of the amino lipid. Suitable scaffolds are known to those of skill in the art.

Specific examples of PEG-modified lipids (or lipid-polyoxyethylene conjugates) that are useful in the present invention can have a variety of “anchoring” lipid portions to secure the PEG portion to the surface of the lipid vesicle. Examples of suitable PEG-modified lipids include PEG-modified phosphatidylethanolamine and phosphatidic acid, PEG-ceramide conjugates (e.g., PEG-CerC14 or PEG-CerC20) which are described in U.S. Pat. No. 5,820,873, incorporated herein by reference, PEG-modified dialkylamines and PEG-modified 1,2-diacyloxypropan-3-amines. Particularly preferred are PEG-modified diacylglycerols and dialkylglycerols.

Examples of suitable neutral lipid include DPSC, DPPC, POPC, DOPE, SM, and mixtures thereof.

As used herein “nucleic acid,” in its broadest sense, includes any compound and/or substance that comprise a polymer of nucleotides linked via a phosphodiester bond. Exemplary nucleic acids include ribonucleic acids (RNAs), deoxyribonucleic acids (DNAs), threose nucleic acids (TNAs), glycol nucleic acids (GNAs), peptide nucleic acids (PNAs), locked nucleic acids (LNAs) or hybrids thereof. They may also include RNAi-inducing agents, RNAi agents, siRNAs, shRNAs, miRNAs, antisense RNAs, ribozymes, catalytic DNA, tRNA, RNAs that induce triple helix formation, aptamers, vectors, etc. In some embodiments, the nucleic acid encoding the Cas protein is an mRNA. In some embodiments, the Cas protein is encoded by a modified nucleic acid (e.g., a synthetic, modified mRNA described herein).

The present invention contemplates the use of any nucleic acid modification available to the skilled artisan. The nucleic acids of the present invention can include any number of modifications. In some embodiments, the nucleic acid comprises one or more modifications selected from the group consisting of pyridin-4-one ribonucleoside, 5-aza-uridine, 2-thio-5-aza-uridine, 2-thiouridine, 4-thio-pseudouridine, 2-thio-pseudouridine, 5-hydroxyuridine, 3-methyluridine, 5-carboxymethyl-uridine, 1-carboxymethyl-pseudouridine, 5-propynyl-uridine, 1-propynyl-pseudouridine, 5-taurinomethyluridine, 1-taurinomethyl-pseudouridine, 5-taurinomethyl-2-thio-uridine, 1-taurinomethyl-4-thio-uridine, 5-methyl-uridine, 1-methyl-pseudouridine, 4-thio-1-methyl-pseudouridine, 2-thio-1-methyl-pseudouridine, 1-methyl-1-deaza-pseudouridine, 2-thio-1-methyl-1-deaza-pseudouridine, dihydrouridine, dihydropseudouridine, 2-thio-dihydrouridine, 2-thio-dihydropseudouridine, 2-methoxyuridine, 2-methoxy-4-thio-uridine, 4-methoxy-pseudouridine, 4-methoxy-2-thio-pseudouridine, 5-aza-cytidine, pseudoisocytidine, 3-methyl-cytidine, N4-acetylcytidine, 5-formylcytidine, N4-methylcytidine, 5-hydroxymethylcytidine, 1-methyl-pseudoisocytidine, pyrrolo-cytidine, pyrrolo-pseudoisocytidine, 2-thio-cytidine, 2-thio-5-methyl-cytidine, 4-thio-pseudoisocytidine, 4-thio-1-methyl-pseudoisocytidine, 4-thio-1-methyl-1-deaza-pseudoisocytidine, 1-methyl-1-deaza-pseudoisocytidine, zebularine, 5-aza-zebularine, 5-methyl-zebularine, 5-aza-2-thio-zebularine, 2-thio-zebularine, 2-methoxy-cytidine, 2-methoxy-5-methyl-cytidine, 4-methoxy-pseudoisocytidine, 4-methoxy-1-methyl-pseudoisocytidine, 2-aminopurine, 2,6-diaminopurine, 7-deaza-adenine, 7-deaza-8-aza-adenine, 7-deaza-2-aminopurine, 7-deaza-8-aza-2-aminopurine, 7-deaza-2,6-diaminopurine, 7-deaza-8-aza-2,6-diaminopurine, 1-methyladenosine, N6-methyladenosine, N6-isopentenyladenosine, N6-(cis-hydroxyisopentenyl)adenosine, 2-methylthio-N6-(cis-hydroxyisopentenyl)adenosine, N6-glycinylcarbamoyladenosine, N6-threonylcarbamoyladenosine, 2-methylthio-N6-threonyl carbamoyladenosine, N6,N6-dimethyladenosine, 7-methyladenine, 2-methylthio-adenine, and 2-methoxy-adenine, inosine, 1-methyl-inosine, wyosine, wybutosine, 7-deaza-guanosine, 7-deaza-8-aza-guanosine, 6-thio-guanosine, 6-thio-7-deaza-guanosine, 6-thio-7-deaza-8-aza-guanosine, 7-methyl-guanosine, 6-thio-7-methyl-guanosine, 7-methylinosine, 6-methoxy-guanosine, 1-methylguanosine, N2-methylguanosine, N2,N2-dimethylguanosine, 8-oxo-guanosine, 7-methyl-8-oxo-guanosine, 1-methyl-6-thio-guanosine, N2-methyl-6-thio-guanosine, and N2,N2-dimethyl-6-thio-guanosine, and combinations thereof.

Preparation of modified nucleosides and nucleotides used in the manufacture or synthesis of modified RNAs of the present invention can involve the protection and deprotection of various chemical groups. The need for protection and deprotection, and the selection of appropriate protecting groups can be readily determined by one skilled in the art.

The chemistry of protecting groups can be found, for example, in Greene, et al., Protective Groups in Organic Synthesis, 2d. Ed., Wiley & Sons, 1991, which is incorporated herein by reference in its entirety.

Modified nucleosides and nucleotides can be prepared according to the synthetic methods described in Ogata et al. Journal of Organic Chemistry 74:2585-2588, 2009; Purmal et al. Nucleic Acids Research 22(1): 72-78, 1994; Fukuhara et al. Biochemistry 1(4): 563-568, 1962; and Xu et al. Tetrahedron 48(9): 1729-1740, 1992, each of which are incorporated by reference in their entirety.

Modified nucleic acids (e.g., ribonucleic acids) need not be uniformly modified along the entire length of the molecule. Different nucleotide modifications and/or backbone structures may exist at various positions in the nucleic acid. One of ordinary skill in the art will appreciate that the nucleotide analogs or other modification(s) may be located at any position(s) of a nucleic acid such that the function of the nucleic acid is not substantially decreased. A modification may also be a 5′ or 3′ terminal modification. The nucleic acids may contain at a minimum one and at maximum 100% modified nucleotides, or any intervening percentage, such as at least 50% modified nucleotides, at least 80% modified nucleotides, or at least 90% modified nucleotides.

In some embodiments, at least one of the one to two ribonucleic acids is a modified ribonucleic acid. In some embodiments, each of the one to two ribonucleic acids is a modified ribonucleic acid. In some embodiments, at least one of the multiple ribonucleic acids is a modified ribonucleic acid. In some embodiments, a plurality of the multiple ribonucleic acids are modified. In some embodiments, each of the multiple ribonucleic acids are modified. Those skilled in the art will appreciate that the modified ribonucleic acids can include one or more of the nucleic acid modification described herein.

In some aspects, provided herein are synthetic, modified RNA molecules encoding polypeptides, where the synthetic, modified RNA molecules comprise one or more modifications, such that introducing the synthetic, modified RNA molecules to a cell results in a reduced innate immune response relative to a cell contacted with synthetic RNA molecules encoding the polypeptides not comprising the one or more modifications. In some embodiments, the Cas protein comprises a synthetic, modified RNA molecule encoding a Cas protein. In some embodiments, the Cas protein comprises a synthetic, modified RNA molecule encoding a Cas9 protein.

The synthetic, modified RNAs described herein include modifications to prevent rapid degradation by endo- and exo-nucleases and to avoid or reduce the cell's innate immune or interferon response to the RNA. Modifications include, but are not limited to, for example, (a) end modifications, e.g., 5′ end modifications (phosphorylation dephosphorylation, conjugation, inverted linkages, etc.), 3′ end modifications (conjugation, DNA nucleotides, inverted linkages, etc.), (b) base modifications, e.g., replacement with modified bases, stabilizing bases, destabilizing bases, or bases that base pair with an expanded repertoire of partners, or conjugated bases, (c) sugar modifications (e.g., at the 2′ position or 4′ position) or replacement of the sugar, as well as (d) internucleoside linkage modifications, including modification or replacement of the phosphodiester linkages. To the extent that such modifications interfere with translation (i.e., results in a reduction of 50% or more in translation relative to the lack of the modification—e.g., in a rabbit reticulocyte in vitro translation assay), the modification is not suitable for the methods and compositions described herein. Specific examples of synthetic, modified RNA compositions useful with the methods described herein include, but are not limited to, RNA molecules containing modified or non-natural internucleoside linkages. Synthetic, modified RNAs having modified internucleoside linkages include, among others, those that do not have a phosphorus atom in the internucleoside linkage. In other embodiments, the synthetic, modified RNA has a phosphorus atom in its internucleoside linkage(s).

Non-limiting examples of modified internucleoside linkages include phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, methyl and other alkyl phosphonates including 3′-alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates including 3′-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and boranophosphates having normal 3′-5′ linkages, 2′-5′ linked analogs of these, and those) having inverted polarity wherein the adjacent pairs of nucleoside units are linked 3′-5′ to 5′-3′ or 2′-5′ to 5′-2′. Various salts, mixed salts and free acid forms are also included.

Representative U.S. patents that teach the preparation of the above phosphorus-containing linkages include, but are not limited to, U.S. Pat. Nos. 3,687,808; 4,469,863; 4,476,301; 5,023,243; 5,177,195; 5,188,897; 5,264,423; 5,276,019; 5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,316; 5,550,111; 5,563,253; 5,571,799; 5,587,361; 5,625,050; 6,028,188; 6,124,445; 6,160,109; 6,169,170; 6,172,209; 6,239,265; 6,277,603; 6,326,199; 6,346,614; 6,444,423; 6,531,590; 6,534,639; 6,608,035; 6,683,167; 6,858,715; 6,867,294; 6,878,805; 7,015,315; 7,041,816; 7,273,933; 7,321,029; and U.S. Pat. RE39464, each of which is herein incorporated by reference in its entirety.

Modified internucleoside linkages that do not include a phosphorus atom therein have internucleoside linkages that are formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatoms and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages. These include those having morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; alkene containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH2 component parts.

Representative U.S. patents that teach the preparation of modified oligonucleosides include, but are not limited to, U.S. Pat. Nos. 5,034,506; 5,166,315; 5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,64,562; 5,264,564; 5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 5,663,312; 5,633,360; 5,677,437; and 5,677,439, each of which is herein incorporated by reference in its entirety.

Some embodiments of the synthetic, modified RNAs described herein include nucleic acids with phosphorothioate internucleoside linkages and oligonucleosides with heteroatom internucleoside linkage, and in particular —CH2-NH—CH2-, —CH2-N(CH3)-O—CH2-[known as a methylene (methylimino) or MMI], —CH2-O—N(CH3)-CH2-, —CH2-N(CH3)-N(CH3)-CH2- and —N(CH3)-CH2-CH2- [wherein the native phosphodiester internucleoside linkage is represented as —O—P—O—CH2-] of the above-referenced U.S. Pat. No. 5,489,677, and the amide backbones of the above-referenced U.S. Pat. No. 5,602,240, both of which are herein incorporated by reference in their entirety. In some embodiments, the nucleic acid sequences featured herein have morpholino backbone structures of the above-referenced U.S. Pat. No. 5,034,506, herein incorporated by reference in its entirety.

Synthetic, modified RNAs described herein can also contain one or more substituted sugar moieties. The nucleic acids featured herein can include one of the following at the 2′ position: H (deoxyribose); OH (ribose); F; O—, S—, or N-alkyl; O—, S—, or N-alkenyl; O—, S- or N-alkynyl; or O-alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynyl can be substituted or unsubstituted C1 to C10 alkyl or C2 to C10 alkenyl and alkynyl. Exemplary modifications include O[(CH2)nO]mCH3, O(CH2).nOCH3, O(CH2)nNH2, O(CH2)nCH3, O(CH2)nONH2, and O(CH2)nON[(CH2)nCH3)]2, where n and m are from 1 to about 10. In some embodiments, synthetic, modified RNAs include one of the following at the 2′ position: C1 to C10 lower alkyl, substituted lower alkyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH3, OCN, Cl, Br, CN, CF3, OCF3, SOCH3, SO2CH3, ONO2, NO2, N3, NH2, heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, a reporter group, an intercalator, a group for improving the pharmacokinetic properties of an RNA, or a group for improving the pharmacodynamic properties of a synthetic, modified RNA, and other substituents having similar properties. In some embodiments, the modification includes a 2′ methoxyethoxy (2′-O—CH2CH2OCH3, also known as 2′-O-(2-methoxyethyl) or 2′-MOE) (Martin et al., Helv. Chim. Acta, 1995, 78:486-504) i.e., an alkoxy-alkoxy group. Another exemplary modification is 2′-dimethylaminooxyethoxy, i.e., a O(CH2)2ON(CH3)2 group, also known as 2′-DMAOE, and 2′-dimethylaminoethoxyethoxy (also known in the art as 2′-O-dimethylaminoethoxyethyl or 2′-DMAEOE), i.e., 2′-O—CH2-O—CH2-N(CH2)2.

Other modifications include 2′-methoxy (2′-OCH3), 2′-aminopropoxy (2′-OCH2CH2CH2NH2) and 2′-fluoro (2′-F). Similar modifications can also be made at other positions on the nucleic acid sequence, particularly the 3′ position of the sugar on the 3′ terminal nucleotide or in 2′-5′ linked nucleotides and the 5′ position of 5′ terminal nucleotide. A synthetic, modified RNA can also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl sugar. Representative U.S. patents that teach the preparation of such modified sugar structures include, but are not limited to, U.S. Pat. Nos. 4,981,957; 5,118,800; 5,319,080; 5,359,044; 5,393,878; 5,446,137; 5,466,786; 5,514,785; 5,519,134; 5,567,811; 5,576,427; 5,591,722; 5,597,909; 5,610,300; 5,627,053; 5,639,873; 5,646,265; 5,658,873; 5,670,633; and 5,700,920, certain of which are commonly owned with the instant application, and each of which is herein incorporated by reference in its entirety.

As non-limiting examples, synthetic, modified RNAs described herein can include at least one modified nucleoside including a 2′-O-methyl modified nucleoside, a nucleoside comprising a 5′ phosphorothioate group, a 2′-amino-modified nucleoside, 2′-alkyl-modified nucleoside, morpholino nucleoside, a phosphoramidate or a non-natural base comprising nucleoside, or any combination thereof.

In some embodiments of this aspect and all other such aspects described herein, the at least one modified nucleoside is selected from the group consisting of 5-methylcytidine (5mC), N6-methyladenosine (m6A), 3,2′-O-dimethyluridine (m4U), 2-thiouridine (s2U), 2′ fluorouridine, pseudouridine, 2′-O-methyluridine (Urn), 2′ deoxyuridine (2′ dU), 4-thiouridine (s4U), 5-methyluridine (m5U), 2′-O-methyladenosine (m6A), N6,2′-O-dimethyladenosine (m6Am), N6,N6,2′-O-trimethyladenosine (m62Am), 2′-O-methylcytidine (Cm), 7-methylguanosine (m7G), 2′-O-methylguanosine (Gm), N2,7-dimethylguanosine (m2,7G), N2,N2,7-trimethylguanosine (m2,2,7G), and inosine (I).

Alternatively, a synthetic, modified RNA can comprise at least two modified nucleosides, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 15, at least 20 or more, up to the entire length of the nucleotide. At a minimum, a synthetic, modified RNA molecule comprising at least one modified nucleoside comprises a single nucleoside with a modification as described herein. It is not necessary for all positions in a given synthetic, modified RNA to be uniformly modified, and in fact more than one of the aforementioned modifications can be incorporated in a single synthetic, modified RNA or even at a single nucleoside within a synthetic, modified RNA. However, it is preferred, but not absolutely necessary, that each occurrence of a given nucleoside in a molecule is modified (e.g., each cytosine is a modified cytosine e.g., 5mC). However, it is also contemplated that different occurrences of the same nucleoside can be modified in a different way in a given synthetic, modified RNA molecule (e.g., some cytosines modified as 5mC, others modified as 2′-O-methylcytidine or other cytosine analog). The modifications need not be the same for each of a plurality of modified nucleosides in a synthetic, modified RNA. Furthermore, in some embodiments of the aspects described herein, a synthetic, modified RNA comprises at least two different modified nucleosides. In some such preferred embodiments of the aspects described herein, the at least two different modified nucleosides are 5-methylcytidine and pseudouridine. A synthetic, modified RNA can also contain a mixture of both modified and unmodified nucleosides.

As used herein, “unmodified” or “natural” nucleosides or nucleobases include the purine bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) and uracil (U). In some embodiments, a synthetic, modified RNA comprises at least one nucleoside (“base”) modification or substitution. Modified nucleosides include other synthetic and natural nucleobases such as inosine, xanthine, hypoxanthine, nubularine, isoguanisine, tubercidine, 2-(halo)adenine, 2-(alkyl)adenine, 2-(propyl)adenine, 2 (amino)adenine, 2-(aminoalkyl)adenine, 2 (aminopropyl)adenine, 2 (methylthio) N6 (isopentenyl)adenine, 6 (alkyl)adenine, 6 (methyl)adenine, 7 (deaza)adenine, 8 (alkenyl)adenine, 8-(alkyl)adenine, 8 (alkynyl)adenine, 8 (amino)adenine, 8-(halo)adenine, 8-(hydroxyl)adenine, 8 (thioalkyl)adenine, 8-(thiol)adenine, N6-(isopentyl)adenine, N6 (methyl)adenine, N6, N6 (dimethyl)adenine, 2-(alkyl)guanine, 2 (propyl)guanine, 6-(alkyl)guanine, 6 (methyl)guanine, 7 (alkyl)guanine, 7 (methyl)guanine, 7 (deaza)guanine, 8 (alkyl)guanine, 8-(alkenyl)guanine, 8 (alkynyl)guanine, 8-(amino)guanine, 8 (halo)guanine, 8-(hydroxyl)guanine, 8 (thioalkyl)guanine, 8-(thiol)guanine, N (methyl)guanine, 2-(thio)cytosine, 3 (deaza) 5 (aza)cytosine, 3-(alkyl)cytosine, 3 (methyl)cytosine, 5-(alkyl)cytosine, 5-(alkynyl)cytosine, 5 (halo)cytosine, 5 (methyl)cytosine, 5 (propynyl)cytosine, 5 (propynyl)cytosine, 5 (trifluoromethyl)cytosine, 6-(azo)cytosine, N4 (acetyl)cytosine, 3 (3 amino-3 carboxypropyl)uracil, 2-(thio)uracil, 5 (methyl) 2 (thio)uracil, 5 (methylaminomethyl)-2 (thio)uracil, 4-(thio)uracil, 5 (methyl) 4 (thio)uracil, 5 (methylaminomethyl)-4 (thio)uracil, 5 (methyl) 2,4 (dithio)uracil, 5 (methylaminomethyl)-2,4 (dithio)uracil, 5 (2-aminopropyl)uracil, 5-(alkyl)uracil, 5-(alkynyl)uracil, 5-(allylamino)uracil, 5 (aminoallyl)uracil, 5 (aminoalkyl)uracil, 5 (guanidiniumalkyl)uracil, 5 (1,3-diazole-1-alkyl)uracil, 5-(cyanoalkyl)uracil, 5-(dialkylaminoalkyl)uracil, 5 (dimethylaminoalkyl)uracil, 5-(halo)uracil, 5-(methoxy)uracil, uracil-5 oxyacetic acid, 5 (methoxycarbonylmethyl)-2-(thio)uracil, 5 (methoxycarbonyl-methyl)uracil, 5 (propynyl)uracil, 5 (propynyl)uracil, 5 (trifluoromethyl)uracil, 6 (azo)uracil, dihydrouracil, N3 (methyl)uracil, 5-uracil (i.e., pseudouracil), 2 (thio)pseudouracil,4 (thio)pseudouracil,2,4-(dithio)psuedouracil,5-(alkyl)pseudouracil, 5-(methyl)pseudouracil, 5-(alkyl)-2-(thio)pseudouracil, 5-(methyl)-2-(thio)pseudouracil, 5-(alkyl)-4 (thio)pseudouracil, 5-(methyl)-4 (thio)pseudouracil, 5-(alkyl)-2,4 (dithio)pseudouracil, 5-(methyl)-2,4 (dithio)pseudouracil, 1 substituted pseudouracil, 1 substituted 2(thio)-pseudouracil, 1 substituted 4 (thio)pseudouracil, 1 substituted 2,4-(dithio)pseudouracil, 1 (aminocarbonylethylenyl)-pseudouracil, 1 (aminocarbonylethylenyl)-2(thio)-pseudouracil, 1 (aminocarbonylethylenyl)-4 (thio)pseudouracil, 1 (aminocarbonylethylenyl)-2,4-(dithio)pseudouracil, 1 (aminoalkylaminocarbonylethylenyl)-pseudouracil, 1 (aminoalkylamino-carbonylethylenyl)-2(thio)-pseudouracil, 1 (aminoalkylaminocarbonylethylenyl)-4 (thio)pseudouracil, 1 (aminoalkylaminocarbonylethylenyl)-2,4-(dithio)pseudouracil, 1,3-(diaza)-2-(oxo)-phenoxazin-1-yl, 1-(aza)-2-(thio)-3-(aza)-phenoxazin-1-yl, 1,3-(diaza)-2-(oxo)-phenthiazin-1-yl, 1-(aza)-2-(thio)-3-(aza)-phenthiazin-1-yl, 7-substituted 1,3-(diaza)-2-(oxo)-phenoxazin-1-yl, 7-substituted 1-(aza)-2-(thio)-3-(aza)-phenoxazin-1-yl, 7-substituted 1,3-(diaza)-2-(oxo)-phenthiazin-1-yl, 7-substituted 1-(aza)-2-(thio)-3-(aza)-phenthiazin-1-yl, 7-(aminoalkylhydroxy)-1,3-(diaza)-2-(oxo)-phenoxazin-1-yl, 7-(aminoalkylhydroxy)-1-(aza)-2-(thio)-3-(aza)-phenoxazin-1-yl, 7-(aminoalkylhydroxy)-1,3-(diaza)-2-(oxo)-phenthiazin-1-yl, 7-(aminoalkylhydroxy)-1-(aza)-2-(thio)-3-(aza)-phenthiazin-1-yl, 7-(guanidiniumalkylhydroxy)-1,3-(diaza)-2-(oxo)-phenoxazin-1-yl, 7-(guanidiniumalkylhydroxy)-1-(aza)-2-(thio)-3-(aza)-phenoxazin-1-yl, 7-(guanidiniumalkylhydroxy)-1,3-(diaza)-2-(oxo)-phenthiazin-1-yl, 7-(guanidiniumalkylhydroxy)-1-(aza)-2-(thio)-3-(aza)-phenthiazin-1-yl, 1,3,5-(triaza)-2,6-(dioxa)-naphthalene, inosine, xanthine, hypoxanthine, nubularine, tubercidine, isoguanisine, inosinyl, 2-aza-inosinyl, 7-deaza-inosinyl, nitroimidazolyl, nitropyrazolyl, nitrobenzimidazolyl, nitroindazolyl, aminoindolyl, pyrrolopyrimidinyl, 3-(methyl)isocarbostyrilyl, 5-(methyl)isocarbostyrilyl, 3-(methyl)-7-(propynyl)isocarbostyrilyl, 7-(aza)indolyl, 6-(methyl)-7-(aza)indolyl, imidizopyridinyl, 9-(methyl)-imidizopyridinyl, pyrrolopyrizinyl, isocarbostyrilyl, 7-(propynyl)isocarbostyrilyl, propynyl-7-(aza)indolyl, 2,4,5-(trimethyl)phenyl, 4-(methyl)indolyl, 4,6-(dimethyl)indolyl, phenyl, napthalenyl, anthracenyl, phenanthracenyl, pyrenyl, stilbenzyl, tetracenyl, pentacenyl, difluorotolyl, 4-(fluoro)-6-(methyl)benzimidazole, 4-(methyl)benzimidazole, 6-(azo)thymine, 2-pyridinone, 5 nitroindole, 3 nitropyrrole, 6-(aza)pyrimidine, 2 (amino)purine, 2,6-(diamino)purine, 5 substituted pyrimidines, N2-substituted purines, N6-substituted purines, 06-substituted purines, substituted 1,2,4-triazoles, pyrrolo-pyrimidin-2-on-3-yl, 6-phenyl-pyrrolo-pyrimidin-2-on-3-yl, para-substituted-6-phenyl-pyrrolo-pyrimidin-2-on-3-yl, ortho-substituted-6-phenyl-pyrrolo-pyrimidin-2-on-3-yl, bis-ortho-substituted-6-phenyl-pyrrolo-pyrimidin-2-on-3-yl, para-(aminoalkylhydroxy)-6-phenyl-pyrrolo-pyrimidin-2-on-3-yl, ortho-(aminoalkylhydroxy)-6-phenyl-pyrrolo-pyrimidin-2-on-3-yl, bis-ortho-(aminoalkylhydroxy)-6-phenyl-pyrrolo-pyrimidin-2-on-3-yl, pyridopyrimidin-3-yl, 2-oxo-7-amino-pyridopyrimidin-3-yl, 2-oxo-pyridopyrimidine-3-yl, or any O-alkylated or N-alkylated derivatives thereof. Modified nucleosides also include natural bases that comprise conjugated moieties, e.g. a ligand. As discussed herein above, the RNA containing the modified nucleosides must be translatable in a host cell (i.e., does not prevent translation of the polypeptide encoded by the modified RNA). For example, transcripts containing s2U and m6A are translated poorly in rabbit reticulocyte lysates, while pseudouridine, m5U, and m5C are compatible with efficient translation. In addition, it is known in the art that 2′-fluoro-modified bases useful for increasing nuclease resistance of a transcript, leads to very inefficient translation. Translation can be assayed by one of ordinary skill in the art using e.g., a rabbit reticulocyte lysate translation assay.

Further modified nucleobases include those disclosed in U.S. Pat. No. 3,687,808, those disclosed in Modified Nucleosides in Biochemistry, Biotechnology and Medicine, Herdewijn, P. ed. Wiley-VCH, 2008; those disclosed in Int. Appl. No. PCT/US09/038,425, filed Mar. 26, 2009; those disclosed in The Concise Encyclopedia Of Polymer Science And Engineering, pages 858-859, Kroschwitz, J. L, ed. John Wiley & Sons, 1990, and those disclosed by Englisch et al., Angewandte Chemie, International Edition, 1991, 30, 613.

Representative U.S. patents that teach the preparation of certain of the above noted modified nucleobases as well as other modified nucleobases include, but are not limited to, the above noted U.S. Pat. No. 3,687,808, as well as U.S. Pat. Nos. 4,845,205; 5,130,30; 5,134,066; 5,175,273; 5,367,066; 5,432,272; 5,457,187; 5,457,191; 5,459,255; 5,484,908; 5,502,177; 5,525,711; 5,552,540; 5,587,469; 5,594,121, 5,596,091; 5,614,617; 5,681,941; 6,015,886; 6,147,200; 6,166,197; 6,222,025; 6,235,887; 6,380,368; 6,528,640; 6,639,062; 6,617,438; 7,045,610; 7,427,672; and 7,495,088, each of which is herein incorporated by reference in its entirety, and U.S. Pat. No. 5,750,692, also herein incorporated by reference in its entirety.

Another modification for use with the synthetic, modified RNAs described herein involves chemically linking to the RNA one or more ligands, moieties or conjugates that enhance the activity, cellular distribution or cellular uptake of the RNA. Ligands can be particularly useful where, for example, a synthetic, modified RNA is administered in vivo. Such moieties include but are not limited to lipid moieties such as a cholesterol moiety (Letsinger et al., Proc. Natl. Acid. Sci. USA, 1989, 86: 6553-6556, herein incorporated by reference in its entirety), cholic acid (Manoharan et al., Biorg. Med. Chem. Let., 1994, 4:1053-1060, herein incorporated by reference in its entirety), a thioether, e.g., beryl-S-tritylthiol (Manoharan et al., Ann. N.Y. Acad. Sci., 1992, 660:306-309; Manoharan et al., Biorg. Med. Chem. Let., 1993, 3:2765-2770, each of which is herein incorporated by reference in its entirety), a thiocholesterol (Oberhauser et al., Nucl. Acids Res., 1992, 20:533-538, herein incorporated by reference in its entirety), an aliphatic chain, e.g., dodecandiol or undecyl residues (Saison-Behmoaras et al., EMBO J, 1991, 10:1111-1118; Kabanov et al., FEBS Lett., 1990, 259:327-330; Svinarchuk et al., Biochimie, 1993, 75:49-54, each of which is herein incorporated by reference in its entirety), a phospholipid, e.g., di-hexadecyl-rac-glycerol or triethyl-ammonium 1,2-di-O-hexadecyl-rac-glycero-3-phosphonate (Manoharan et al., Tetrahedron Lett., 1995, 36:3651-3654; Shea et al., Nucl. Acids Res., 1990, 18:3777-3783, each of which is herein incorporated by reference in its entirety), a polyamine or a polyethylene glycol chain (Manoharan et al., Nucleosides & Nucleotides, 1995, 14:969-973, herein incorporated by reference in its entirety), or adamantane acetic acid (Manoharan et al., Tetrahedron Lett., 1995, 36:3651-3654, herein incorporated by reference in its entirety), a palmityl moiety (Mishra et al., Biochim. Biophys. Acta, 1995, 1264:229-237, herein incorporated by reference in its entirety), or an octadecylamine or hexylamino-carbonyloxycholesterol moiety (Crooke et al., J. Pharmacol. Exp. Ther., 1996, 277:923-937, herein incorporated by reference in its entirety).

The synthetic, modified RNAs described herein can further comprise a 5′ cap. In some embodiments of the aspects described herein, the synthetic, modified RNAs comprise a 5′ cap comprising a modified guanine nucleotide that is linked to the 5′ end of an RNA molecule using a 5′-5′ triphosphate linkage. As used herein, the term “5′ cap” is also intended to encompass other 5′ cap analogs including, e.g., 5′ diguanosine cap, tetraphosphate cap analogs having a methylene-bis(phosphonate) moiety (see e.g., Rydzik, A M et al., (2009) Org Biomol Chem 7(22):4763-76), dinucleotide cap analogs having a phosphorothioate modification (see e.g., Kowalska, J. et al., (2008) RNA 14(6):1119-1131), cap analogs having a sulfur substitution for a non-bridging oxygen (see e.g., Grudzien-Nogalska, E. et al., (2007) RNA 13(10): 1745-1755), N7-benzylated dinucleoside tetraphosphate analogs (see e.g., Grudzien, E. et al., (2004) RNA 10(9):1479-1487), or anti-reverse cap analogs (see e.g., Jemielity, J. et al., (2003) RNA 9(9): 1108-1122 and Stepinski, J. et al., (2001) RNA 7(10):1486-1495). In one such embodiment, the 5′ cap analog is a 5′ diguanosine cap. In some embodiments, the synthetic, modified RNA does not comprise a 5′ triphosphate.

The 5′ cap is important for recognition and attachment of an mRNA to a ribosome to initiate translation. The 5′ cap also protects the synthetic, modified RNA from 5′ exonuclease mediated degradation. It is not an absolute requirement that a synthetic, modified RNA comprise a 5′ cap, and thus in other embodiments the synthetic, modified RNAs lack a 5′ cap. However, due to the longer half-life of synthetic, modified RNAs comprising a 5′ cap and the increased efficiency of translation, synthetic, modified RNAs comprising a 5′ cap are preferred herein.

The synthetic, modified RNAs described herein can further comprise a 5′ and/or 3′ untranslated region (UTR). Untranslated regions are regions of the RNA before the start codon (5′) and after the stop codon (3′), and are therefore not translated by the translation machinery. Modification of an RNA molecule with one or more untranslated regions can improve the stability of an mRNA, since the untranslated regions can interfere with ribonucleases and other proteins involved in RNA degradation. In addition, modification of an RNA with a 5′ and/or 3′ untranslated region can enhance translational efficiency by binding proteins that alter ribosome binding to an mRNA. Modification of an RNA with a 3′ UTR can be used to maintain a cytoplasmic localization of the RNA, permitting translation to occur in the cytoplasm of the cell. In one embodiment, the synthetic, modified RNAs described herein do not comprise a 5′ or 3′ UTR. In another embodiment, the synthetic, modified RNAs comprise either a 5′ or 3′ UTR. In another embodiment, the synthetic, modified RNAs described herein comprise both a 5′ and a 3′ UTR. In one embodiment, the 5′ and/or 3′ UTR is selected from an mRNA known to have high stability in the cell (e.g., a murine alpha-globin 3′ UTR). In some embodiments, the 5′ UTR, the 3′ UTR, or both comprise one or more modified nucleosides.

In some embodiments, the synthetic, modified RNAs described herein further comprise a Kozak sequence. The “Kozak sequence” refers to a sequence on eukaryotic mRNA having the consensus (gcc)gccRccAUGG SEQ ID NO: 1481, where R is a purine (adenine or guanine) three bases upstream of the start codon (AUG), which is followed by another ‘G’. The Kozak consensus sequence is recognized by the ribosome to initiate translation of a polypeptide. Typically, initiation occurs at the first AUG codon encountered by the translation machinery that is proximal to the 5′ end of the transcript. However, in some cases, this AUG codon can be bypassed in a process called leaky scanning. The presence of a Kozak sequence near the AUG codon will strengthen that codon as the initiating site of translation, such that translation of the correct polypeptide occurs. Furthermore, addition of a Kozak sequence to a synthetic, modified RNA will promote more efficient translation, even if there is no ambiguity regarding the start codon. Thus, in some embodiments, the synthetic, modified RNAs described herein further comprise a Kozak consensus sequence at the desired site for initiation of translation to produce the correct length polypeptide. In some such embodiments, the Kozak sequence comprises one or more modified nucleosides.

In some embodiments, the synthetic, modified RNAs described herein further comprise a “poly (A) tail”, which refers to a 3′ homopolymeric tail of adenine nucleotides, which can vary in length (e.g., at least 5 adenine nucleotides) and can be up to several hundred adenine nucleotides). The inclusion of a 3′ poly(A) tail can protect the synthetic, modified RNA from degradation in the cell, and also facilitates extra-nuclear localization to enhance translation efficiency. In some embodiments, the poly(A) tail comprises between 1 and 500 adenine nucleotides; in other embodiments the poly(A) tail comprises at least 5, at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, at least 110, at least 120, at least 130, at least 140, at least 150, at least 160, at least 170, at least 180, at least 190, at least 200, at least 225, at least 250, at least 275, at least 300, at least 325, at least 350, at least 375, at least 400, at least 425, at least 450, at least 475, at least 500 adenine nucleotides or more. In one embodiment, the poly(A) tail comprises between 1 and 150 adenine nucleotides. In another embodiment, the poly(A) tail comprises between 90 and 120 adenine nucleotides. In some such embodiments, the poly(A) tail comprises one or more modified nucleosides.

It is contemplated that one or more modifications to the synthetic, modified RNAs described herein permit greater stability of the synthetic, modified RNA in a cell. To the extent that such modifications permit translation and either reduce or do not exacerbate a cell's innate immune or interferon response to the synthetic, modified RNA with the modification, such modifications are specifically contemplated for use herein. Generally, the greater the stability of a synthetic, modified RNA, the more protein can be produced from that synthetic, modified RNA. Typically, the presence of AU-rich regions in mammalian mRNAs tend to destabilize transcripts, as cellular proteins are recruited to AU-rich regions to stimulate removal of the poly(A) tail of the transcript. Loss of a poly(A) tail of a synthetic, modified RNA can result in increased RNA degradation. Thus, in one embodiment, a synthetic, modified RNA as described herein does not comprise an AU-rich region. In particular, it is preferred that the 3′ UTR substantially lacks AUUUA sequence elements.

In one embodiment, a ligand alters the cellular uptake, intracellular targeting or half-life of a synthetic, modified RNA into which it is incorporated. In some embodiments a ligand provides an enhanced affinity for a selected target, e.g., molecule, cell or cell type, intracellular compartment, e.g., mitochondria, cytoplasm, peroxisome, lysosome, as, e.g., compared to a composition absent such a ligand. Preferred ligands do not interfere with expression of a polypeptide from the synthetic, modified RNA.

Ligands can include a naturally occurring substance, such as a protein (e.g., human serum albumin (HSA), low-density lipoprotein (LDL), or globulin); carbohydrate (e.g., a dextran, pullulan, chitin, chitosan, inulin, cyclodextrin or hyaluronic acid); or a lipid. The ligand can also be a recombinant or synthetic molecule, such as a synthetic polymer, e.g., a synthetic polyamino acid. Examples of polyamino acids include polylysine (PLL), poly L aspartic acid, poly L-glutamic acid, styrene-maleic acid anhydride copolymer, poly(L-lactide-co-glycolied) copolymer, divinyl ether-maleic anhydride copolymer, N-(2-hydroxypropyl)methacrylamide copolymer (HMPA), polyethylene glycol (PEG), polyvinyl alcohol (PVA), polyurethane, poly(2-ethylacryllic acid), N-isopropylacrylamide polymers, or polyphosphazine. Example of polyamines include: polyethylenimine, polylysine (PLL), spermine, spermidine, polyamine, pseudopeptide-polyamine, peptidomimetic polyamine, dendrimer polyamine, arginine, amidine, protamine, cationic lipid, cationic porphyrin, quaternary salt of a polyamine, or an alpha helical peptide.

Ligands can also include targeting groups, e.g., a cell targeting agent, (e.g., a lectin, glycoprotein, lipid or protein), or an antibody, that binds to a specified cell type such as a fibroblast cell. A targeting group can be, for example, a thyrotropin, melanotropin, lectin, glycoprotein, surfactant protein A, Mucin carbohydrate, multivalent lactose, multivalent galactose, N-acetyl-galactosamine, N-acetyl-glucosamine multivalent mannose, multivalent fucose, glycosylated polyaminoacids, multivalent galactose, transferrin, bisphosphonate, polyglutamate, polyaspartate, a lipid, cholesterol, a steroid, bile acid, folate, vitamin B12, biotin, or an RGD peptide or RGD peptide mimetic, among others.

Other examples of ligands include dyes, intercalating agents (e.g. acridines), cross-linkers (e.g. psoralene, mitomycin C), porphyrins (TPPC4, texaphyrin, Sapphyrin), polycyclic aromatic hydrocarbons (e.g., phenazine, dihydrophenazine), artificial endonucleases (e.g. EDTA), lipophilic molecules, e.g., cholesterol, cholic acid, adamantane acetic acid, 1-pyrene butyric acid, dihydrotestosterone, 1,3-Bis-O(hexadecyl)glycerol, geranyloxyhexyl group, hexadecylglycerol, borneol, menthol, 1,3-propanediol, heptadecyl group, palmitic acid, myristic acid,O3-(oleoyl)lithocholic acid, O3-(oleoyl)cholenic acid, dimethoxytrityl, or phenoxazine) and peptide conjugates (e.g., antennapedia peptide, Tat peptide), alkylating agents, amino, mercapto, PEG (e.g., PEG-40K), MPEG, [MPEG]2, polyamino, alkyl, substituted alkyl, radiolabeled markers, enzymes, haptens (e.g. biotin), and transport/absorption facilitators (e.g., aspirin, vitamin E, folic acid).

Ligands can be proteins, e.g., glycoproteins, or peptides, e.g., molecules having a specific affinity for a co-ligand, or antibodies e.g., an antibody, that binds to a specified cell type such as a fibroblast cell, or other cell useful in the production of polypeptides. Ligands can also include hormones and hormone receptors. They can also include non-peptidic species, such as lipids, lectins, carbohydrates, vitamins, cofactors, multivalent lactose, multivalent galactose, N-acetyl-galactosamine, N-acetyl-glucosamine multivalent mannose, or multivalent fucose.

The ligand can be a substance, e.g., a drug, which can increase the uptake of the synthetic, modified RNA or a composition thereof into the cell, for example, by disrupting the cell's cytoskeleton, e.g., by disrupting the cell's microtubules, microfilaments, and/or intermediate filaments. The drug can be, for example, taxol, vincristine, vinblastine, cytochalasin, nocodazole, japlakinolide, latrunculin A, phalloidin, swinholide A, indanocine, or myoservin.

One exemplary ligand is a lipid or lipid-based molecule. A lipid or lipid-based ligand can (a) increase resistance to degradation, and/or (b) increase targeting or transport into a target cell or cell membrane. A lipid based ligand can be used to modulate, e.g., binding of the modified RNA composition to a target cell.

In another aspect, the ligand is a moiety, e.g., a vitamin, which is taken up by a host cell. Exemplary vitamins include vitamin A, E, and K. Other exemplary vitamins include B vitamin, e.g., folic acid, B12, riboflavin, biotin, pyridoxal or other vitamins or nutrients taken up, for example, by cancer cells. Also included are HSA and low density lipoprotein (LDL).

In another aspect, the ligand is a cell-permeation agent, preferably a helical cell-permeation agent. Preferably, the agent is amphipathic. An exemplary agent is a peptide such as tat or antennopedia. If the agent is a peptide, it can be modified, including a peptidylmimetic, invertomers, non-peptide or pseudo-peptide linkages, and use of D-amino acids. The helical agent is preferably an alpha-helical agent, which preferably has a lipophilic and a lipophobic phase.

A “cell permeation peptide” is capable of permeating a cell, e.g., a microbial cell, such as a bacterial or fungal cell, or a mammalian cell, such as a human cell. A microbial cell-permeating peptide can be, for example, an α-helical linear peptide (e.g., LL-37 or Ceropin P1), a disulfide bond-containing peptide (e.g., α-defensin, β-defensin or bactenecin), or a peptide containing only one or two dominating amino acids (e.g., PR-39 or indolicidin). For example, a cell permeation peptide can be a bipartite amphipathic peptide, such as MPG, which is derived from the fusion peptide domain of HIV-1 gp41 and the NLS of SV40 large T antigen (Simeoni et al., Nucl. Acids Res. 31:2717-2724, 2003).

The synthetic, modified RNAs described herein can be synthesized and/or modified by methods well established in the art, such as those described in “Current Protocols in Nucleic Acid Chemistry,” Beaucage, S. L. et al. (Edrs.), John Wiley & Sons, Inc., New York, N.Y., USA, which is hereby incorporated herein by reference in its entirety. Transcription methods are described further herein in the Examples.

In one embodiment of the aspects described herein, a template for a synthetic, modified RNA is synthesized using “splint-mediated ligation,” which allows for the rapid synthesis of DNA constructs by controlled concatenation of long oligos and/or dsDNA PCR products and without the need to introduce restriction sites at the joining regions. It can be used to add generic untranslated regions (UTRs) to the coding sequences of genes during T7 template generation. Splint mediated ligation can also be used to add nuclear localization sequences to an open reading frame, and to make dominant-negative constructs with point mutations starting from a wild-type open reading frame. Briefly, single-stranded and/or denatured dsDNA components are annealed to splint oligos which bring the desired ends into conjunction, the ends are ligated by a thermostable DNA ligase and the desired constructs amplified by PCR. A synthetic, modified RNA is then synthesized from the template using an RNA polymerase in vitro. After synthesis of a synthetic, modified RNA is complete, the DNA template is removed from the transcription reaction prior to use with the methods described herein.

In some embodiments of these aspects, the synthetic, modified RNAs are further treated with an alkaline phosphatase.

One skilled in the art readily appreciates that the present invention is well adapted to carry out the objects and obtain the ends and advantages mentioned, as well as those inherent therein. The details of the description and the examples herein are representative of certain embodiments, are exemplary, and are not intended as limitations on the scope of the invention. Modifications therein and other uses will occur to those skilled in the art. These modifications are encompassed within the spirit of the invention. It will be readily apparent to a person skilled in the art that varying substitutions and modifications may be made to the invention disclosed herein without departing from the scope and spirit of the invention.

The articles “a” and “an” as used herein in the specification and in the claims, unless clearly indicated to the contrary, should be understood to include the plural referents. Claims or descriptions that include “or” between one or more members of a group are considered satisfied if one, more than one, or all of the group members are present in, employed in, or otherwise relevant to a given product or process unless indicated to the contrary or otherwise evident from the context. The invention includes embodiments in which exactly one member of the group is present in, employed in, or otherwise relevant to a given product or process. The invention also includes embodiments in which more than one, or all of the group members are present in, employed in, or otherwise relevant to a given product or process. Furthermore, it is to be understood that the invention provides all variations, combinations, and permutations in which one or more limitations, elements, clauses, descriptive terms, etc., from one or more of the listed claims is introduced into another claim dependent on the same base claim (or, as relevant, any other claim) unless otherwise indicated or unless it would be evident to one of ordinary skill in the art that a contradiction or inconsistency would arise. It is contemplated that all embodiments described herein are applicable to all different aspects of the invention where appropriate. It is also contemplated that any of the embodiments or aspects can be freely combined with one or more other such embodiments or aspects whenever appropriate. Where elements are presented as lists, e.g., in Markush group or similar format, it is to be understood that each subgroup of the elements is also disclosed, and any element(s) can be removed from the group. It should be understood that, in general, where the invention, or aspects of the invention, is/are referred to as comprising particular elements, features, etc., certain embodiments of the invention or aspects of the invention consist, or consist essentially of, such elements, features, etc. For purposes of simplicity those embodiments have not in every case been specifically set forth in so many words herein. It should also be understood that any embodiment or aspect of the invention can be explicitly excluded from the claims, regardless of whether the specific exclusion is recited in the specification. For example, any one or more active agents, additives, ingredients, optional agents, types of organism, disorders, subjects, or combinations thereof, can be excluded.

Where the claims or description relate to a composition of matter, it is to be understood that methods of making or using the composition of matter according to any of the methods disclosed herein, and methods of using the composition of matter for any of the purposes disclosed herein are aspects of the invention, unless otherwise indicated or unless it would be evident to one of ordinary skill in the art that a contradiction or inconsistency would arise. Where the claims or description relate to a method, e.g., it is to be understood that methods of making compositions useful for performing the method, and products produced according to the method, are aspects of the invention, unless otherwise indicated or unless it would be evident to one of ordinary skill in the art that a contradiction or inconsistency would arise.

Where ranges are given herein, the invention includes embodiments in which the endpoints are included, embodiments in which both endpoints are excluded, and embodiments in which one endpoint is included and the other is excluded. It should be assumed that both endpoints are included unless indicated otherwise. Furthermore, it is to be understood that unless otherwise indicated or otherwise evident from the context and understanding of one of ordinary skill in the art, values that are expressed as ranges can assume any specific value or subrange within the stated ranges in different embodiments of the invention, to the tenth of the unit of the lower limit of the range, unless the context clearly dictates otherwise. It is also understood that where a series of numerical values is stated herein, the invention includes embodiments that relate analogously to any intervening value or range defined by any two values in the series, and that the lowest value may be taken as a minimum and the greatest value may be taken as a maximum. Numerical values, as used herein, include values expressed as percentages. For any embodiment of the invention in which a numerical value is prefaced by “about” or “approximately”, the invention includes an embodiment in which the exact value is recited. For any embodiment of the invention in which a numerical value is not prefaced by “about” or “approximately”, the invention includes an embodiment in which the value is prefaced by “about” or “approximately”.

Approximately” or “about” generally includes numbers that fall within a range of 1% or in some embodiments within a range of 5% of a number or in some embodiments within a range of 10% of a number in either direction (greater than or less than the number) unless otherwise stated or otherwise evident from the context (except where such number would impermissibly exceed 100% of a possible value). It should be understood that, unless clearly indicated to the contrary, in any methods claimed herein that include more than one act, the order of the acts of the method is not necessarily limited to the order in which the acts of the method are recited, but the invention includes embodiments in which the order is so limited. It should also be understood that unless otherwise indicated or evident from the context, any product or composition described herein may be considered “isolated”.

As used herein the term “comprising” or “comprises” is used in reference to compositions, methods, and respective component(s) thereof, that are essential to the invention, yet open to the inclusion of unspecified elements, whether essential or not.

As used herein the term “consisting essentially of” refers to those elements required for a given embodiment. The term permits the presence of additional elements that do not materially affect the basic and novel or functional characteristic(s) of that embodiment of the invention.

The term “consisting of” refers to compositions, methods, and respective components thereof as described herein, which are exclusive of any element not recited in that description of the embodiment.

EXAMPLES Example 1

Transcription activator-like effector nucleases (TALENs) bind as a pair around a genomic site, in which a double-strand break (DSB) is introduced by a dimer of FokI nuclease domains. The use of a TALEN genome-editing system to rapidly and efficiently generate mutant alleles of 15 different genes in human pluripotent stem cells (hPSCs) as a means of performing rigorous disease modeling was recently reported (Ding et al., Cell Stem Cell 12:238-251 (2013)); the proportions of clones bearing at least one mutant allele ranged from 2%-34%.

As described below, the relative efficacies of CRISPRs and TALENs targeting the same genomic sites in the same hPSC lines was assessed with the use of the same delivery platform described previously (Ding et al., Cell Stem Cell 12:238-251 (2013)). In the TALEN genome-editing system, the CAG promoter was used to co-translate (via a viral 2A peptide) each TALEN with green fluorescent protein (GFP) or red fluorescent protein (RFP). For CRISPRs, a human codon-optimized Cas9 gene was subcloned with a C-terminal nuclear localization signal (Mali et al., Science 339:823-826 (2013)) into the same CAG expression plasmid with GFP, and the guide RNA (gRNA) was separately expressed from a plasmid with the human U6 polymerase III promoter (Mali et al., Science 339:823-826 (2013)). The 20-nucleotide protospacer sequence for each gRNA was introduced using polymerase chain reaction (PCR)-based methods. Whether using TALENs or CRISPRs, equal amounts of the two plasmids were co-electroporated into hPSCs (either 25 μg of each plasmid, or 12.5 μg of each plasmid along with 25 μg of a DNA repair template if attempting knock-in) followed by fluorescence-activated cell sorting (FACS) after 24-48 hours, clonal expansion of single cells, and screening for mutations at the genomic target site via PCR.

gRNAs were designed matching G(N)19NGG sequences in seven loci in six genes (AKT2, CELSR2, CIITA, GLUT4, LINC00116, and SORT1) previously successfully targeted with TALENs (Ding et al., Cell Stem Cell 12:238-251 (2013)) and one additional locus in LDLR. In this system, CRISPRs consistently and substantially outperformed TALENs across loci and hPSC lines (see Table S1). The TALENs yielded clones with at least one mutant allele at efficiencies of 0%-34%, but matched CRISPRs yielded mutant clones at efficiencies of 51%-79% (Table S1). Just as with TALENs, CRISPRs produced a variety of indels of sizes ranging from one nucleotide to several dozen nucleotides in size, centered on the predicted cleavage sites, suggesting that non-homologous end-joining mutagenesis occurs in the same way regardless of whether CRISPRs or TALENs are used. Moreover, CRISPRs readily generated homozygous mutant clones (7%-25% of all clones; Table S1) as discerned by sequencing.

Knock-in of E17K mutations into AKT2 was also attempted using a 67-nucleotide single-stranded DNA oligonucleotide as previously described (Ding et al., Cell Stem Cell 12:238-251 (2013)). Although the predicted CRISPR cleavage site lay 11 and 13 nucleotides from the point mutations, respectively, the CRISPR yielded knock-in clones at a rate of 11%, whereas TALENs yielded only 1.6% (Table S1).

TABLE S1 Targeting Efficiency of CRISPRs Versus TALENs in Human Pluripotent Stem Cells CRISPRs TALENs Efficiency Efficiency Chromosome:  Efficiency (Mutants/ of  Position (Start of Cell (Mutants/Clones Clones Homozygous Gene Target Sequence) Target Sequence 

Line 

Screened)

Screened)

Mutants 4KT2 chr1 

:4 

762 

2 TCCCTTCCTGCCTCATTTCAGGTGAATACATCAAGACCTGGAGGCCA HUES 9  8.9% (17/192) (SEQ ID NO: 335) 4KT2 chr1 

:4 

762 

2 TCCCTTCCT GCC|TCATTTCAGGTGAATACATCAAGACCTGGAGGCCA HUES 9 (SEQ ID NO: 336) 6 

.6% (86/142) 12.7% (18/142) CELSR2 chr1:10 

1766 

TGCTGGCTCGGCTGCCCTGAGGTTGCTCAATCAAGCACAGGTTTCAA HUES 1 3.5% (1 

/506) (SEQ ID NO: 337) CELSR2 chr1:10 

1766 

TGCTGGCTCGGCTGCCCTGAGGTTGCTCAATCAAG|CAC AGGTTTCAA HUES 1 (SEQ ID NO: 338) 66.2% (45/68)  7.4% ( 

/ 

8) CIITA chr16:1 

2 

TAACAGCGATGCTGACCCCCTGTGCCTCTACCACTTCTATGACCAGA BJ-RiPS 12.7% (37/292) (SEQ ID NO: 339) CIITA chr16:1 

2 

CGATGCTCACCCCCTGTGCCTCTACCA CTT|CTATGACCAGATGGACC BJ-RiPS (SEQ ID NO: 340)  78.7% (96/122) 11. 

% (14/122) GLUT4 chr17:71 

66 

1 TGGTCCTTGCTGTGTTCTCTGCGGTGCTTGGCTCCCTGCAGTTTGGGTA HUES 9 33.5% (52/155) (SEQ ID NO: 341) GLUT4 chr17:71 

66 

1 TGGTCCTTGCTGTGTTCT|CTG CGGTGCTTGGCTCCCTGCAGTTTGGGTA HUES 9 (SEQ ID NO: 342)   66.6% (123/185) 24. 

% (4 

/1 

) LDLR chr19:1121 

TGGGCGACAGATGCGAAAGAAACGAGTTCCAGTGCCAAGACGGGAAA HUES 9   0% (0/568) (SEQ ID NO: 343) LDLR chr19:1121 

17 GAAACGAGTTCCAGTGCCAAGACGGGAAATGCATCTCCTAC|AAG TGG HUES 9 (SEQ ID NO: 344)  51.1% (90/176)  8.0% (14/17 

) LINC00116 chr2:11 

7 

TCAGAGAGGACACTGCAGTTGTCCGTGCTAGTAGCCTTCGCTTCTGGA HUES 9 29.5% (26/66)  (SEQ ID NO: 345) LINC00116 chr2:11 

7 

TCAGAGAGGACACTGCAGTTGTCCGTGCTAGTAGCCTTCGC|TTC TGGA HUES 9 (SEQ ID NO: 346)  57.4% (93/162) 8. 

% (14/1 

2) SORT1  chr1:1 

122 

3 TGATGATCTCAGAGGCTCAGTATCCTTGTCCTGGGTTGGAGATAGCA HUES 1 22.2% (12 

/57 

) (SEQ ID NO: 347) exon 2 SORT1  chr1:1 

122 

3 TGATGATCTCAGAGGCTCAGTATCCTTG|TCC TGGGTTGGAGATAGCA HUES 1 (SEQ ID NO: 348)   68.5% (100/146) 13.0% exon 2 (19/14 

) SORT1  chr1:1 

1 

TGGTAATTATGACTTTTGGACAGTCCAAGCTATATCGAAGGTGAGATCA HUES 9 10.9% (21/192) (SEQ ID NO: 349) exon 3 SORT1  chr1:1 

1 

TGGTAATTATGACTTTTGGACAGTCCAAGCTATA|CGA AGGTGAGATCA HUES 9 (SEQ ID NO: 350) 75.9% (14 

/195) 10.3% exon 3 (20/1 

5) AKT2 E17K chr19:4 

7 

2 

2 TCCCTTCCTGCCTCATTTCAGGTGA

TACATCAAGACCTGGAGGCCA HUES 9 1.6% (3/1 

2)

(SEQ ID NO: 351) AKT2 E17K chr19:4 

7 

2 

2

HUES 9 (SEQ ID NO: 352) 10.6% (10/94)

 1.1% (1/94)

AKT2 off-target chr5:22 

72

HUES 9 (SEQ ID NO: 353)    0% (0/142)    0% (0/142)

For TALENs, the binding sites are indicated with underlines, with the cleavage site predicted to be midway between the binding sites

 for CRISPRs, the protospacer is underlined, the NGG motif is in bold (may be on the antisense strand), and the predicted cleavage site is indicated with “|”; for the AKT2 E17K target sequence, the sites of the knock-in mutations are indicated in bold/italics; for the AKT2 off-target site, the two mismatches in the protospacer are indicated in bold/italics

HUES 1 and HUES 9 are human embryonic stem cell lines; BJ-RiPS is an induced plunpotent stem cell line

Mutants include single heterozygotes, compound heterozygotes, and homozygous mutants; TALEN data is from Table 1 of Ding et al. (2013), with the exception of LDLR

Successfully inserted E17K knock-in mutations into an AKT2 allele(s) using single-stranded DNA oligonucleotide (refer to FIG. 3 of Ding et al, 2013)

indicates data missing or illegible when filed

It is worth noting that the requirement for a G(N)19NGG target sequence somewhat limits site selection. Because either DNA strand can be targeted, a target sequence occurs on average every 32 basepairs. This is no barrier for gene knockout, where any coding sequence can be targeted, but it may present difficulties when trying to knock in or correct a mutation at a specific location. However, the requirement for a G at the start of the protospacer is dictated by the use of the U6 promoter to express the gRNA, and alternative CRISPR/Cas systems can relieve this requirement (Cong et al., Science 339:819-823 (2013)). This allows for the use of (N)20NGG target sequences, which are found on average every 8 basepairs.

In addition, the extent of CRISPR off-target effects remains to be defined and is highly sequence-dependent. Previous analyses have suggested that one-nucleotide mismatches in the first half of the protospacer are better tolerated than mismatches in second half (Jinek et al., Science 337:816-821 (2012); Cong et al., Science 339:819-823 (2013)). For the AKT2 sequence, there is a two-mismatch sequence differing at nucleotides 1 and 3, in the more “tolerant” half of the protospacer. Zero clones were obtained with mutations at this potential off-target site, as compared to 61% at the on-target site (Table 51). For one of the SORT1 sequences, use of a different human pluripotent stem cell line in which a single nucleotide polymorphism results in a one-nucleotide mismatch at the target site yielded mutant clones at an efficiency of 42%, compared to 66% in the original cell line. Thus, judicious selection of target sites is necessary to minimize systematic off-target effects; target sites with perfect-match or single-nucleotide-mismatch sequences elsewhere in the genome should be avoided.

From a practical standpoint, CRISPRs are easier to implement than TALENs. Each TALEN pair must be constructed de novo, whereas for CRISPRs the Cas9 component is fixed and the gRNA requires only swapping of the 20-nucleotide protospacer. Given this consideration and the demonstration herein of substantially increased efficiency as a result of replacing TALENs with CRISPRs in an otherwise identical system, CRISPRs appear to be a very powerful and broadly applicable tool for genome editing, particularly in a therapeutic context.

Example 2: Modified Cas9 mRNA Functions to Efficiently Introduce On-Target Mutations

The inventors generated Figment (Fgm) knockout mice by CRIPSR/Cas9 gene editing utilizing a modified Cas9 mRNA. Fgm is a coding gene within the long non-coding RNA Lnc-Rap-5 (referred to herein as Fgm (Lnc-Rap-5; see Sun et al., “Long noncoding RNAs regulate adipogenesis,” PNAS; 2013; 110(9):3387-3392, incorporated herein by reference in its entirety). The guide RNA (gRNA) sequence employed in this example was: 5′ gaggcgaaagccactagcac 3′. The modified Cas9 mRNA used in this example was made using an in vitro transcription reaction in which pseudouridine and 5-methyl-cytosine were reacted with unmodified nucleotides and randomly integrated into the resulting modified Cas9 mRNA. An exemplary protocol for generating Fgm knockout mice using CRISPR/Cas9 gene editing utilizing a modified Cas9mRNA is shown in FIG. 19A. As shown in FIG. 19A, 100 ng/μl of the resulting modified Cas9 mRNA and 50 ng/μl of the guide RNA noted above targeting Fgm (Lnc-Rap-5) were injected into 250 C57BL/6 mouse zygotes that were subsequently transferred to pseudo-pregnant mice and after weening screened for mutations by PCR. As shown in the gel pictured in FIG. 19B, PCR screening revealed 63 mutant animals out of 65. These results indicate that modified Cas9 mRNA functions in vivo to efficiently (i.e., 97% efficiency) introduce on target mutations in mammals.

Example 3: Mutational Analysis of Genome Edited Hematopoietic Stem-Progenitor Cells (HSPCs) by Target Capture Deep Sequencing

CRISPR/Cas9 has previously been shown to generate off-target mutations to varying degrees depending upon experimental setting and cell type (Cho et al., 2014; Cradick et al., 2013; Fu et al., 2013; Fu et al., 2014; Hruscha et al., 2013; Lin et al., 2014). To examine this in primary CD34⁺ HSPCs we performed target capture sequencing, of CD34⁺ HSPCs-mPB subjected to CRISPR/Cas9 CCR5-editing. Experimental design included capture of each gRNA target site (n=6) and predicted off-target sites (n=172) with expanded capture intervals of 500 base pairs flanking each site to ensure accurate detection of any genetic lesion occurring at or near the selected sites (FIGS. 17A and 20). We have previously shown that this approach can also capture structural variation breakpoints, such as translocations and inversions, in proximity to the capture site (Talkowski et al., 2011) (See supplemental text and methods for detailed description). Sorted CD34⁺ HSPCs treated with Cas9 alone or in combination with multiple single gRNA (crCCR5_A, crCCR5_B, or crCCR5_C) or dual gRNA combinations (crCCR5_A+B, crCCR5_C+D, or crCCR5_D+Q) were sequenced to a mean target coverage of 3,390× across each 23 bp gRNA sequence and PAM (range 379.6×-7,969.5×)(FIG. 17B). Analysis of the resulting data revealed highly efficacious on-target mutagenesis with a diverse array of mutated sequence variants observed in both single-gRNA and dual-gRNA treatments (FIG. 17C). As expected we detected small InDels of up to 10 bp in addition to varying single nucleotide substitutions at the predicted target sites in the single-gRNA libraries. Strikingly, in each dual-gRNA library, no fewer than 15 alternate mutant alleles were observed at either one of the gRNA sites (FIGS. 21-23). Notably, the extreme sequencing depth of our analysis permitted estimation of mutation frequency for each particular variant, including mutations that were observed in only a few hundredths of a percent of the sample sequenced (FIG. 24). Predicted deletions (i.e. deletions spanning between the two gRNA target sites) were the most common mutations observed (crCCR5_A+B: 19.95%; crCCR5_C+D: 20.45%; crCCR5_D+Q: 42.13%), while small InDels (crCCR5_A+B: 3.06%; crCCR5_C+D: 0.50%; crCCR5_D+Q: 2.95%) were also frequent (FIG. 17C). Interestingly, for two dual gRNA combinations (crCCR5_A+B and crCCR5_D+Q) we also observed inversions between the two predicted Cas9 cleavage sites (crCCR5_A+B: 3.06%; crCCR5_D+Q: 2.48%). The most efficacious dual gRNA combination crCCR5_D+Q led to mutations in approximately 48% of the captured sequence reads (FIG. 17C).

We next examined the capture sequence reads at predicted off-target sites in the genome (FIG. 20). An N-fold enrichment analysis was performed, wherein we compared the total number of non-reference sequencing reads at each predicted off-target site in gRNA treated and control (Cas9 only) samples. This analysis generated a ratio where 1.0 indicates an equivalent number of non-reference sequence reads in both treated and control samples, values less than 1.0 indicate fewer non-reference reads in treated samples, and values greater than 1.0 indicate a greater number of non-reference reads in treated samples (see supplementary materials for additional details) (FIG. 17D). Strikingly, this analysis showed that the mean enrichment of mutations at off-target sites in all the gRNA-treated samples compared to control closely conformed to the null hypothesis (i.e., 0.99-fold enrichment compared to controls) indicating that off-target mutation events were extremely rare. Indeed, statistical evaluation of all captured off-target sites yielded a single site (1/172; 0.6%) in the sample treated with gRNA crCCR5_B alone that passed multiple test correction for a statistically significant enrichment for off-target InDels in the gRNA crCCR5_B treated libraries versus control (p≤7.6×10⁻¹¹) (FIGS. 24-25). When we scrutinized the sequencing reads from the only statistically significant off-target site, which was located in the highly homologous CCR2 gene (FIG. 18A), we found that all sequence variants (36 out of 5,963 total reads) were one or two base InDels, (FIG. 18B). Of note, the other sample in which gRNA crCCR5_B was used (in combination with gRNA crCCR5_A) only 13 out of 5,339 reads supported mutation, however these events did not meet statistical significance above control or samples treated with other gRNAs (FIG. 18B, FIG. 24). Thus, off-target mutagenesis was exceedingly rare and moreover, the use of two gRNAs in combination did not increase the very low incidence of off-target mutagenesis. We also performed targeted analyses for structural variation at all sites and though we could easily detect on-target inversions in dual gRNA combination crCCR5_A+B and crCCR5_D+Q, there was no evidence for inversion or translocation at any off-target sites in any of the treatments. These data indicate that on-target mutagenesis efficiency was very high, and further that off-target mutagenesis was extremely infrequent for both single- and dual gRNA treatments.

Discussion

Our mutational analysis revealed highly efficacious mutagenesis of on-target sites in CD34⁺ HSPCs. Single gRNAs generated a range of mutations with the vast majority comprised of small InDels. In contrast, dual gRNA combinations largely led to predicted deletions through a diverse array of mutations including InDels and even inversions were detected. Importantly, we only identified one statistically significant off-target site in the highly homologous CCR2 gene, which occurred in one out of 6 experimental settings (gRNA crCCR5_B alone). Sequence analysis of gRNA crCCR5_B in comparison to the identified off-target site in CCR2 indicated that it perfectly matched in the seed region and contained 3 sequence mismatches at the 5′ end of the gRNA sequence (positions 1, 4 and 6). This data is consistent with previous studies showing that mismatches in the 5′ proximal end of the gRNA are well tolerated by Cas9 (Lin et al., 2014; Wu et al., 2014). Our data therefore supports the idea that judicious guide design is critical for minimizing off-target mutations. Of note, our very deep sequencing analysis enabled detection of the sole off-target event we describe, whereas sequence analysis performed at lower sequencing depth—such as 50× coverage that has been used in previous off-target analyses (Smith et al., 2014; Suzuki et al., 2014; Veres et al., 2014)—would have been unable to detect this event. Overall, our analysis of CRISPR/Cas9 mutational activity in CD34⁺ HSPCs revealed very high on-target mutation rates and extremely low incidence of off-target mutagenesis.

Off-Target Prediction and Capture Sequencing

Degenerate gRNA off-target sequences were predicted for each gRNA targeting CCR5 using the CRISPR Design off-target prediction tool (Hsu et al., 2013). Off-target sequences were further supplemented by alignment of each gRNA to the human genome using BOWTIE of which all results up to and including 3 mismatches were added to the total off-target list (Langmead et al., 2009). All instances of each predicted off-target sequence existent in the human genome reference build GRCh37v71 were recorded (FIG. 20). Each guide RNA target site (n=6) and predicted off-target site (n=172) was selected for capture sequencing using the Agilent SureSelectXT Target Enrichment System. Capture intervals were expanded by approximately 500 bp in both the 5′ and 3′ directions to ensure exhaustive capture of the targeted region and detection of any genetic lesion occurring at or near a predicted gRNA on- or off-target site, as we have previously shown accurate capability to detect translocations and inversions using targeted capture of probes in proximity to a rearrangement breakpoint using a CapBP procedure as described (Talkowski et al., 2011). Probes were tiled with 60-fold greater density over each predicted 23 bp on- or off-target gRNA binding site than the flanking kilobase of sequence. Isogenic CD34⁺ HSPCs-mPB were transfected with CRISPR/Cas9 plasmids (one Cas9 only-treated control group, three treatment groups transfected with a single gRNA, and three treatment groups transfected with dual gRNAs). Sorted CD34⁺ genome edited HSPCs were cultured for two weeks prior to DNA isolation. Capture libraries were prepared from DNA extracted from seven treatment groups. Capture libraries were sequenced as 101 bp paired-end reads on an Illumina HiSeq2000 platform.

NGS Data Processing and Computational Analysis

Read pairs were aligned to GRCh37v71 with Bwa-MEM v0.7.10-r789 (Li, arXiv 2013). Alignments were processed using PicardTools and SAMBLASTER (Faust and Hall, 2014). The Genome Analysis Toolkit (GATK) v3.1-1-g07a4bf8 was applied for base quality score recalibration, insertion/deletion (InDel) realignment, duplicate removal, and single nucleotide variant (SNV) and InDel discovery and genotyping per published best-practice protocols (McKenna et al, Genome Res 2010; DePristo et al, Nat Genet 2011; Van der Auwera et al, 2013). SNVs and InDels were annotated using ANNOVAR (Wang et al., 2010). Structural variants (SVs) were detected with LUMPY v0.2.5 considering both anomalous pair and split read evidence at a minimum call weight threshold of 7 and an evidence set score ≤0.05 (Layer et al., 2014). Candidate copy number variants (CNVs) were further statistically assessed by Student's t-test for a concomitant change in depth of coverage across the putative CNV. As a final exhaustive measure, each on- and off-target site was manually scrutinized in each capture library for evidence supporting predictable mutagenesis that is not detectable by the computational algorithms due to low levels of mosaicism in the sequenced population.

Evaluation of Off-Target Mutation Frequency

A statistical framework was developed to assess off-target mutational burden for each gRNA. For each off-target site (n=172), all reads with at least one nucleotide of overlap with that 23 bp off-target site were collected and their CIGAR information was tabulated into categories as follows: reads representing small InDels (CIGAR contains at least one “I” or “D”), reads potentially representative of other rearrangements (CIGAR contains at least one “S” or “H”), and reads reflecting reference sequence (CIGAR did not match either of the two former categories). Such counts were gathered at all 172 sites in all seven libraries and were further pooled to form comparison groups of “treatment” libraries (transfected gRNA matches corresponding off-target site gRNA) and “control” libraries (transfected gRNA does not match corresponding off-target site gRNA). Next, at each off-target site, relative n-fold enrichment of each read classification between treatment and control libraries was evaluated. Finally, a one-tailed Fisher's Exact Test was performed to assess the statistical significance of enrichment of variant reads in treatments versus controls at each off-target site, followed by Bonferroni correction to retain an experiment-wide significance threshold of α=0.05.

REFERENCES

-   1. Cong, L., et al., 2013. Multiplex genome engineering using     CRISPR/Cas systems. Science. 339, 819-23. -   2. Ding, Q., et al., 2013. Enhanced efficiency of human pluripotent     stem cell genome editing through replacing TALENs with CRISPRs. Cell     Stem Cell. 12, 393-4. -   3. Jinek, M., et al., 2013. RNA-programmed genome editing in human     cells. Elife. 2, e00471. -   4. Li, D., et al., 2013. Heritable gene targeting in the mouse and     rat using a CRISPR-Cas system. Nat Biotechnol. 31, 681-3. -   5. Mali, P., et al., 2013. RNA-guided human genome engineering via     Cas9. Science. 339, 823-6. -   6. Niu, Y., et al., 2014. Generation of Gene-Modified Cynomolgus     Monkey via Cas9/RNA-Mediated Gene Targeting in One-Cell Embryos.     Cell. 156, 836-43. -   7. Ran, F. A., et al., 2013. Double nicking by RNA-guided CRISPR     Cas9 for enhanced genome editing specificity. Cell. 154, 1380-9. -   8. Wang, H., et al., 2013. One-step generation of mice carrying     mutations in multiple genes by CRISPR/Cas-mediated genome     engineering. Cell. 153, 910-8. 

What is claimed is:
 1. A method for altering a target severe combined immune deficiency disease (SCID)-associated polynucleotide sequence in a cell ex vivo comprising contacting the SCID-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCID-associated polynucleotide sequence, wherein the target SCID-associated polynucleotide sequence is cleaved.
 2. A method for altering a target sever combined immune deficiency disease (SCID)-associated polynucleotide sequence in a cell ex vivo comprising (a) contacting the SCID-associated polynucleotide sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target SCID-associated polynucleotide sequence, wherein the target SCID-associated polynucleotide sequence is cleaved, and (b) contacting the cleaved target SCID-associated polynucleotide sequences with an exogenously introduced DNA repair template to initiate homology-directed repair of the target SCID-associated polynucleotide.
 3. A method for altering a target IL2RG gene sequence in a cell ex vivo comprising (a) contacting the IL2RG gene sequence with a clustered regularly interspaced short palindromic repeats-associated (Cas) protein and from one to two ribonucleic acids, wherein the ribonucleic acids direct Cas protein to and hybridize to a target motif of the target IL2RG gene sequence, wherein the target IL2RG gene sequence is cleaved, and (b) contacting the cleaved target IL2RG gene sequences with an exogenously introduced DNA repair template to initiate homology-directed repair of the target IL2RG gene sequence.
 4. A method according to any one of claims 1-3, wherein the Cas protein is Streptococcus pyogenes Cas9 protein or a functional portion thereof selected from the group consisting of a DNA binding domain, at least one RNA binding domain, a helicase domain, and an endonuclease domain.
 5. A method according to claim 4, wherein the Cas protein is Streptococcus pyogenes Cas9 protein.
 6. A method according to any one of claims 1-3, wherein the Cas protein is complexed with the one to two ribonucleic acids.
 7. A method according to any one of claims 1-3, wherein the target motif is G(N)₁₉NGG or (N)₂₀NGG.
 8. A method according to any one of claims 1-3, wherein the cell is selected from the group consisting of a peripheral blood cell, a stem cell, a pluripotent cell, a hematopoietic stem cell, a CD34+ cell, a CD34+ mobilized peripheral blood cell, a CD34+ cord blood cell, a CD34+ bone marrow cell, a CD34⁺CD38-Lineage-CD90⁺CD45RA⁻ cell, a primary human cell, a non-transformed human cell, and combinations thereof.
 9. A method according to any one of claims 1-3, wherein the Cas protein is encoded by a nucleic acid sequence comprising a modified nucleic acid selected from the group consisting of pseudouridine, 5-methylcytodine, 2-thio-uridine, 5-methyluridine-5 ′-triphosphate, 4-thiouridine-5′-triphosphate, 5,6-dihydrouridine-5′-triphosphate, and 5-azauridine-5′-triphosphate.
 10. A method according to any one of claims 1-3, wherein at least one of the ribonucleic acids is a modified ribonucleic acid comprising one to two modified nucleotides selected from the group consisting of pseudouridine, 5-methylcytodine, 2-thio-uridine, 5-methyluridine-5′-triphosphate, 4-thiouridine-5′-triphosphate, 5,6-dihydrouridine-5′-triphosphate, and 5-azauridine-5′-triphosphate. 